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AGROBACTERIUM TUMEFACIENS RPOA GENE 

This application is a continuation-in-part of provisional application Serial No. 
60/129,682 filed on April 16, 1999 and a continuation-in-part of provisional application 
Serial No. 60/134,206 filed on May 14, 1999, both of which are incorporated herein in 

0 their entirety by reference. 

This work was supported in part by NSF grant MCB-9722227. The U.S. 
Government may have certain rights to the present invention. 

The present invention relates to nucleic acid molecules and constructs containing 
these nucleic acid molecules and methods of using these constructs to express heterologous 

5 genes in hosts. In particular, the nucleic acid molecules of the present invention comprise a 
DNA sequence encoding at least a portion of the a subunit of the RNA polymerase 
(referred to herein as RNAP) obtained from the same genus source from which the 
heterologous genes were originally isolated. The nucleic acid constructs also can optionally 
comprise, if required for expression of the heterologous genes, at least one gene encoding a 

0 transcriptional regulator, also obtained from the same genus as the source of the 
heterologous genes. 

BACKGROUND OF THE INVENTION 
Although many host cell systems are well characterized and utilized to express a 

.5 number of heterologous genes, there are still many genes that cannot be expressed in such 
host cell systems, as for example, Escherichia coli (£. coli). It is believed that the lack of 
expression in these host cell systems is a result of the promoter sequences of the 
heterologous genes not being recognized by the host RNAP. The present invention has 
solved this problem by showing that this block in expression can be overcome by co- 

i0 expression of at least a portion of the rpoA gene product, the gene that encodes the a 

subunit of the RNAP, obtained from the same genus as the source of the heterologous gene 
that is desired to be expressed in the new host cell. The co-expressed a subunit of the 
RNAP from the same genus as the source of the heterologous gene combines with the other 
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subunits of the RNAP, p, p', and a, to form a functional RNAP. Additionally, if 
transcriptional regulators, such as transcriptional activators or transcriptional repressors, or 
additional other subunits of the RNAP, such as p, P' or a, are required to obtain expression 
of the heterologous gene, then these additional components also are obtained from the same 
5 genus as the source of the heterologous gene. Since two different rpoA genes are present in 
a single host cell, there will be various combinations of RNAPs present in the host cell (1) 
RNAP containing two a subunits of the same genus as the host (2) RNAP containing two a 
subunits from the same genus source as the heterologous gene, and (3) RNAP containing 
one a subunit from the same genus as the host and one a subunit from the same genus 

10 source as the heterologous gene. A heterologous gene is intended to mean a gene that is 
not from the same source as the host cell. 

Agrobacterium tumefaciens is a Gram-negative soil bacterium which is the causative 
agent of Crown Gall disease, affecting primarily dicotyledonous plant species (reviewed in 
18, 62). The pathogen incites production of the characteristic tumor through the transfer of 

15 a piece of DNA (T-DNA) from the Ti (Tumor inducing) plasmid into susceptible plant 
cells, with subsequent integration into the host genome. The T-DNA contains genes that 
direct the biosynthesis of auxin and cytokinin in infected cells (1, 57), resulting in 
uncontrolled cell division leading to production of the characteristic tumor. The T-DNA 
also contains genes for the biosynthesis of unique compounds called opines which the 

20 bacterium can utilize as a carbon and nitrogen source (39). 

Successful transfer of the T-DNA is dependent on the coordinated expression of 
virulence (vir) genes located on the Ti plasmid but separate from the T-DNA. Expression 
of vir genes occurs in response to certain phenolic compounds released from wounded 
plants (54). This expression is augmented by monosaccharides (5, 52), and an acidic pH 

25 (38) which are characteristics of plant wound sites. Expression of vir genes requires vir A 
and virG, which are members of the family of two component regulatory systems (60). 
VirA is an inner membrane associated histidine protein kinase which autophosphorylates in 
response to the environmental signals (19, 28). The phosphate moiety is subsequently 
transferred to the aspartate residue of VirG, which in turn activates transcription from 

30 promoters containing a specific 12 base pair sequence called the vir box, present in the 
promoters of all vir genes (29, 44). In addition to vir A and virG, other chromosomally 

2 
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encoded genes have been identified in A. tumefaciens that have been shown to modulate 
virulence gene expression either directly or indirectly (12, 15, 20, 61). 

The use of E. coil as a heterologous host in which to study the regulation of A. 
tumefaciens virulence genes and the mechanism of T-DNA transfer constitutes an ideal 
5 model system given the degree of characterization at both the biochemical and genetic level. 
However, all previous attempts to reconstitute vir gene expression in E. coil have not been 
successful. Possible explanations for the lack of vir gene expression include the presence of 
unidentified regulatory genes in A. tumefaciens required for vir induction, and/or that E. 
coli may contain specific repressor(s) of vir gene induction. 

10 A characteristic of vir gene promoters is the absence of a strong -35 sequence (10). 

Dnase I footprinting studies have shown that VirG protects a region extending into where 
the -35 consensus sequence should be (29, 44). It has been suggested that binding of VirG 
may functionally replace the -35 consensus sequence allowing transcription to occur. This 
situation is similar to Class II CAP-dependent promoters, in which the CAP binding site 

15 overlaps with the -35 sequence. Studies have demonstrated that transcription at Class II 
CAP-dependent promoters requires interaction between CAP and the a subunit (rpoA) of 
RNAP (42, 49, 64). Many transcriptional factors from E. coli are known to require 
interaction with the a subunit of RNAP , including FNR (59), GaiR (9), Mar A (24), Mer R 
(7), MetR (23), OxyR (56), OmpR (21), Rob (26), SoxS (25), and TyrR (34). Analysis of 

20 the a subunit from E. coli indicates the presence of two independent domains, the N- 

terminal domain and the C-terminal domain (21, 27, 63). The N-tenninal domain (NTD) is 
involved in the assembly of the core polymerase, while the C-terminal domain (CTD) is 
involved in interaction with certain transcriptional regulators (7, 9, 21, 23, 24, 25, 26, 32, 
34, 47, 56, 59). Recently, interaction between the NTD and CAP at Class II CAP- 

25 dependent promoters has been demonstrated (42,49). 

The present methods of expressing heterologous genes are particularly useful to 
express multiple genes or operons in a metabolic pathway by co-expressing the rpoA gene 
product obtained from the same natural hosts from which the genes or operons were 
originally isolated with the multiple genes. The present method provides an advantage over 

30 existing multiple gene expression systems by eliminating the need to separately isolate each 
gene in the metabolic pathyway and link each gene to a promoter that is functional in the 
host. Expression of heterologous genes for an entire metabolic pathway in hosts is 
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particularly useful to produce particular metabolites, such as pharmaceuticals, food 
supplements, chemical products or fine chemical products. 

The present invention further provides nucleic acid molecules and methods whereby 
heterologous genes encoding metabolites in an entire metabolic pathway can be introduced 

5 into hosts so that these hosts possess specific catabolic and/or metabolic characteristics that 
allow the hosts to grow or express gene products under specific environmental conditions. 
These hosts are useful in bioremediation, bioassays and biocontrol studies where 
manipulation of natural or symbiotic bacterial flora is desired, such as in the control of 
parasites or insects where bacterial symbiants are involved. 

10 The present invention broadly relates to the expression of heterologous genes in a 

host using nucleic acid molecules comprising a gene encoding the complete a subunit of the 
RNAP obtained from the same genus as the source of the heterologous genes or at least a 
portion of the a subunit of the RNAP. The host and the heterologous genes to be expressed 
may be prokaryotic or eukaryotic in origin, however, prokaryotic hosts are preferred. The 

15 present invention also relates to a hybrid nucleic acid molecule encoding a hybrid a subunit 
of RNAP, where a first nucleic acid comprises at least a portion of the rpoA gene encoding 
the a subunit of an RNAP obtained from the same genus as the source of the host cell and a 
second nucleic acid comprises at least a portion of the a subunit of an RNAP obtained from 
the same genus as the heterologous gene. 

20 The present invention specifically discloses the rpoA gene from Agrobacterium 

encoding the a subunit of the RNAP and the corresponding amino acid sequence. Portions 
of both the rpoA gene and the a subunit of RNAP of Agrobacterium also are encompassed 
by the present invention, particularly the sequences disclosed in Figs. 2 A and 2B. The 
rpoA gene is useful in methods of expressing Agrobacterium genes in hosts, particularly 

25 prokaryotic host cells. Prior to the present invention, it was not possible to express 

Agrobacterium genes, particularly Agrobacterium virulence genes, in E. coli. Therefore, 
the present invention provides a method for studying Agrobacterium genes and their 
regulation. 

The invention also relates to genes encoding hybrid rpoA genes that comprise at 
30 least a portion of an rpoA gene from the same genus as a heterologous gene which is 
expressed, such as the Agrobacterium rpoA gene, with the remainder of the rpoA gene 
obtained from the same genus as the host cell in which the heterologous gene is expressed. 
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SUMMARY OF THE INVENTION 

The present invention is directed to an isolated nucleic acid molecule encoding the 
complete a subunit of an RN AP of Agrobacterium or at least a portion thereof. 
Specifically, the present invention is directed to an isolated nucleic acid molecule encoding 
5 the complete a subunit of an RNAP of Agrobacterium, in which the amino acid sequence of 
the a subunit is depicted in Figure 2 A or at least a portion of this sequence. More 
specifically, the present invention is directed to an isolated nucleic acid molecule 
comprising the complete nucleic acid sequence as depicted in Figure 2B or at least a portion 
thereof. These isolated nucleic acid molecules encoding at least a portion of the a subunit 

10 of an RNAP of Agrobacterium also are optionally linked to a promoter that is functional in 
the host in which these molecules are expressed. Further, the present invention is directed 
to vectors containing these nucleic acid sequences or molecules. The vectors are useful in 
transforming host cells for the expression of heterologous gene(s). The invention is also 
directed to host cells that are transformed with the nucleic acid molecules described herein. 

15 The nucleic acid molecules useful to transform the host cells are operably linked to a 
promoter and encode an a subunit of an RNAP or at least one heterologous protein. 

The present invention is also directed to a hybrid nucleic acid molecule for the 
expression of at least one heterologous gene in a host cell comprising a first nucleic acid 
sequence encoding at least a portion of the a subunit of an RNAP obtained from the same 

20 genus as the host cell and a second nucleic acid sequence encoding at least a portion of the 
a subunit of an RNAP obtained from the same genus as the source of the heterologous 
gene; and vectors containing the hybrid nucleic acid molecule. 

The present invention is further directed to a method of expressing at least one 
heterologous gene in a host cell comprising transforming a host cell with a vector 

25 comprising a nucleic acid molecule encoding a complete a subunit of RNAP from the same 
genus as the source of the heterologous gene or at least a portion thereof, and also 
transforming the host cell with a vector comprising at least one heterolgous gene. The 
nucleic acid molecule and the heterologous gene(s) also may be contained in one vector but 
preferably these sequences are contained within two or more vectors. In a further 

30 embodiment, the present invention is also directed to a method of expressing at least one 
heterologous gene in a host cell comprising transforming a host cell with a vector 
comprising a hybrid nucleic acid molecule comprising a first nucleic acid sequence 
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encoding at least a portion of the a subunit of an RNAP obtained from the same genus as 
the host cell and a second nucleic acid sequence encoding at least a portion of the a subunit 
of an RNAP obtained from the same genus as the source of the heterologous gene, and 
transforming the host cell with a vector comprising at least one heterologous gene; and 
5 culturing the transformed host cell under conditions where the heterologous gene is 
expressed in the host cell. The host cell also may be transformed with a single vector 
containing the nucleic acid molecule encoding the complete a subunit of the RNAP, a 
portion thereof or a hybrid molecule comprising a portion of the a subunit of the RNAP, 
and with the heterolgous gene(s) or a subunit of the RNAP peferably, two or more vectors 
10 may be used. The nucleic acid molecules encoding the a subunit and the heterologous 
gene(s) may be in separate vectors for transformation of the host cell. The host cell can 
optionally be transformed with at least one gene encoding a heterologous transcriptional 
regulator obtained from the same genus as the source of the heterologous gene. The 
heterologous gene may comprise multiple genes or operons in the same metabolic pathway. 

15 

RRIRF DESCRIPTION OF THE DRAWINGS 

Figure 1 . Physical and genetic map of recombinant plasmids carrying the rpoA 
gene from A. tumefaciens is depicted. Abbreviations: B, BamHI; E. FcoRI; P, PstI; Sa, 
Sm3A; St, Sttd, X, Xhol Location and direction of rpoA and lac promoter for constructs 
20 pPS1.3R and pPS1.3 are indicated by arrows. 

Figure 2A. The amino acid sequence of the A. tumefaciens a subunit of the RNAP 
is depicted. 

Figure 2B. The A. tumefaciens rpoA gene encoding the A. tumefaciens a subunit 
of the RNAP is depicted. The actual coding sequence for the a subunit is located between 
25 nucleotide 355 (atg) and nucleotide 1356 (taa). 

Figure 3. Amino acid sequence alignment of the A. tumefaciens and E. coli rpoA 
gene products. Identical amino acid residues at a given position are marked by an asterisk, 
and conserved substitutions are marked by a dot. Gaps introduced to allow optimal 
alignment are designated by a dash. The bolded nucleotides show three highly conserved 
30 regions between the two amino acid sequences. 

Figure 4 A and B. Assembly of His-rpoA into RNAP, and in vitro transcription 
assays. Figure 4 A - SDS-PAGE analysis of reconstituted RNAP containing £. coli (3, p\ 
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a 70 subunits and His-RpoA from E. coli or A. tumefaciens and stained with Coomassie 
brilliant blueB. Figure 4B - In vitro transcription from virB and galPl promoters in the 
absence and presence of VirG con . The figure shows the result of a multiround transcription 
reaction employing reconstituted RNAP containing His-rpoA from either E. coli or A. 
5 tumefaciens. Where indicated, 2 pmol of VirG"" 1 was added to the reaction. 

Figure 5. Protein-protein and protein-DNA interactions between RpoA, VirG con and 
the virB promoter. The figures show electrophoretic mobility shift assays of [ 32 P]-labeled 
virB promoter containing purified VirG 0011 and His-RpoA from E. coli in Figure 5 A or A. 
tumefaciens in Figure 5B Lane 1: Promoter only, Lane 2: Promoter + ViiG™, Lane 3: 
10 Promoter + 150 nM RpoA, Lane 4: Promoter + 300 nM RpoA, Lane 5: Promoter + 
600 nM RpoA, Lane 6: Promoter + VirG^" + 150 nM RpoA, Lane 7: Promoter + 
VirG™ + 300 nM RpoA, Lane 8: Promoter + VirG"" 1 + 600 nM RpoA. 

DETAILED DESCRIPTION OF THE PREFERR ED EMBODIMENT 

15 Although many well characterized host cells, such as E. coli, provide advantages as 

an expression host cell, it has not been possible to express the genes of many bacteria in 
these host cells because the promoter sequences of the heterologous genes are not 
recognized by the host cell RNAP. The present invention provides a method of 
overcoming this block to expression by co-expressing the a subunit of the RNAP from the 

20 same genus as the source of the heterologous genes that are to be expressed in the host.and 
optionally co-expressing the transcriptional activator from the same genus as the source of 
the heterologous genes. The success of this method has been demonstrated by showing the 
expression of the Agrobacterium tumefaciens virulence (vz'r) gene operons in E. coli. In 
this specific example, expression of these genes was shown to require not only the vir gene 

25 specific transcriptional regulator but also the rpoA gene from A. tumefaciens. The A. 

tumefaciens a subunit of the RNAP is able to assemble with the other subunits (P, P' and a 
subunits) of the E. coli RNAP to form a functional polymerase which specifically 
recognizes the Agrobacterium genes in the presence of the Agrobacterium transcriptional 
regulator, specifically an activator. The results show that this interaction is genus specific, 

30 in that, the transcriptional activator from Agrobacterium can only efficiently interact with 
the rpoA gene product from Agrobacterium and not with that of E. coli. 



7 
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Conventional understanding of gene expression in prokaryotes emphasizes the 
importance of the transcriptional regulator, either an activator or a repressor. A 
transcriptional activator is a protein that is needed to turn on the expression of target genes 
either directly or indirectly whereas repressors are proteins that inhibit the transcription of 
5 target genes either directly or indirectly. In the present invention, it is shown that the a 
subunit of the RNAP is the most critical factor in activating bacterial genes, especially 
those requiring transcriptional regulators. For example, to express heterologous genes or 
operons from organism A in organism B, at least a portion of the rpoA gene from organism 
A is also needed to be co-expressed in organism B. Likewise if transcriptional regulators 

10 are required for expression of heterologous genes, the genes encoding the transcriptional 
regulator(s) from organism A should also be used to transform the host because these 
regulators can most efficiently interact with RpoA from the same organism A but not as 
well or at all with that of organism B. 

The present method is particularly advantageous when it is used to express 

15 particular heterologous metabolic pathway genes in hosts for bioremediation/biocontrol 
purposes or production of particular metabolites. This approach becomes much more 
practical than fusing each gene or operon under the control of a known promoter that is 
functional in the host. 

Expression of genes for an entire metabolic pathway in heterologous hosts is useful 

20 to produce particular metabolites or to confer to the host specific catabolic or metabolic 
characteristics that now enables the host to grow or express gene products under specific 
environmental conditions. 

An example of the utility of the present method is shown by the production of a 
plant growth promoting molecule, indole-3-acetic acid, from Azospirillum brasilense, that 

25 requires at least 3 biosynthetic genes. A. brasilense also possesses multiple genes that 

encode products that fix nitrogen (65). The multiple genes responsible for encoding these 
products and the rpoA gene are introduced via one or more vehicles, such as plasmids or 
cosmids, into a host that is applied to a plant and that expresses the products of the multiple 
genes. The expressed plant growth promoting molecules and nitrogen fixing molecules 

30 affect the growth of the treated plant. 

Further Rhizobium, Brady rhizobium, Azorhiobium and Sinorhizobium are other 
examples of nitrogen-fixing bacteria, each involving at least two dozen genes for nodulation 
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(symbiosis) and multiple genes for nitrogen fixation. These bacteria, as well as other 
bacteria that produce useful products for plants, may also be the source of the heterologous 
genes and the rpoA gene. 

The present invention can also be utilized to express gene product(s) in a catabolic 
5 pathway that degrades or breaks down toxic or environmentally hazardous products. For 
example, a Pseudomonas species strain ADP expresses at least four genes that are involved 
in the degradation of a herbicide, Atrazine (66). These genes are introduced and expressed 
in a host by applying the host to areas where this herbicide is present, resulting in the 
degradation of the herbicide on site. 
10 Further, the present invention is useful in providing a biocontrol system that 

includes a host that expresses heterologous gene products, that for example, kill disease 
producing organisms on a plant, animal, human or a surface that will support growth of the 
host. Pseudomonas chloroaphis strain 30-84 expresses at least three antifungal metabolites, 
phenazine-l-carboxylate, 2-hydroxyphenazine-l-carboxlylic acid and 2-hydroxyphenazine 
15 that involves four biosynthetic genes and 2 regulatory genes (67). These genes may be 

introduced into a plant bacteria, such as Rhizobium, that can be applied to a plant to control 
the growth of fungi on the plant. 

The present invention also is useful as an A. tumefaciens gene delivery system in 
heterologous hosts for potential gene delivery or gene therapy systems of A. tumefaciens 
20 genes to various hosts. 

The present invention also can be used to express heterologous genes in A. 
tumefaciens to alter its characteristics (e.g. host specificity) for a broader application of the 
A. tumefaciens mediated gene delivery system. 

The present invention also is useful to prepare host cells containing an rpoA gene 
25 that is heterologous to the host cell, such as Escherichia. This host cell is useful for 
studying changes in gene expression patterns in this cell. 

The present invention can additionally be used to express particular sets of 
heterologous genes, including multigene operons/stimulons or genes encoding multi-subunit 
protein complexes, such as proteosomes or gene libraries in hosts as targets for drug 
30 screening and/or diagnostics. 

The present invention discloses an isolated nucleic acid molecule encoding the 
complete a subunit of an RN AP of Agrobacterium or at least a portion thereof. More 
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specifically, an isolated nucleic acid molecule encoding the complete a subunit of an RNAP 
of Agrobacterium, where the amino acid sequence of the a subunit is depicted in Figure 2 A 
is disclosed or at least a portion thereof. The portion can be a length of amino acid 
sequences that is sufficient to express any heterologous gene in the host cell into which it is 

5 introduced. The portion of the a subunit of the RNAP also should be of a sufficient length 
to provide at least one of the following functions: 1) bind to DNA in the promoter region 
2) interacts with the other subunits of RNAP; i.e., p, p' and a to form a functional RNAP, 
and 3) interacts with the transcriptional activator. More specifically, an isolated nucleic 
acid molecule comprising at least a portion of the nucleic acid sequence as depicted in 

10 Figure 2B and that encodes at least a portion of the amino acid sequence of Figure 2 A is 
disclosed. The nucleic acid molecule (rpoA) encoding the Agrobacterium a subunit of the 
RNAP or a portion of the nucleic acid' molecule is useful in the expression of 
Agrobacterium genes in any prokaryotic host cells. 

The present invention is also directed to a nucleic acid molecule encoding a hybrid 

15 a subunit of RNAP where the isolated nucleic acid molecule is inserted in a vector and is 
useful in expressing at least one heterologous gene in a host cell comprising a first nucleic 
acid sequence encoding at least a portion of the a subunit of an RNAP obtained from the 
same genus as the host cell and a second nucleic acid sequence encoding at least a portion 
of the a subunit of an RNAP obtained from the same genus as the source of the 

20 heterologous gene. This isolated nucleic acid molecule is operably linked to a promoter so 
that it is under the control of a promoter that is functional in the host cell. For example, 
the E. coli lac or trp promoters are useful to express the rpoA gene in an E. coli host cell. 
Promoters that function in numerous host cells are well known to persons skilled in the art 
and can be selected based upon the host cell selected. The second nucleic acid sequence is 

25 obtained from any eukaryotic or prokaryotic host cell, and more particularly from a 
prokaryotic cell, including Gram-negative bacteria, Gram-positive bacteria, acid-fast 
bacteria, mycoplasma, aerobic bacteria, anaerobic bacteria, and facultative bacteria. More 
specifically, the bacteria are selected from the genus Agrobacterium, Mycobacterium, 
Chlamydia, Pseudomonas or Streptomyces . More preferably, it is obtained from 

30 Agrobacterium, and most preferably the second nucleic acid sequence encodes the complete 
amino acid sequence as depicted in Figure 2A or a portion thereof, with the sufficient 
length of this portion discussed above. The portion of the of the rpoA gene can be from 
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any position in the gene; i.e., that is from the 5' or the 3' end of the gene, which encodes 
the N-terminus and C-terminus of the a-subunit, respectively. Most preferably, the nucleic 
acid molecule portion encodes at least amino acid residues 157 to 336 as depicted in Figure 
2A. The nucleic acid molecule can encode as few as 8 consecutive amino acid residues of 
5 any portion of Figure 2A. Specifically, a portion comprising at least the amino acid 
residues 157 to 336, or of less than amino acid residues of 157 to 336, and more 
specifically amino acid residues 329 to 336 are useful in the present invention. The first 
nucleic acid sequence is obtained from any bacteria but more preferably from the genus 
Escherichia, Pseudomonas , Bacillus, Lactobacillus, Clostridium or Rhizobium. 

10 The present invention is directed to a method of expressing at least one heterologous 

gene in a host cell comprising transforming a host cell with a nucleic acid construct 
comprising a first nucleic acid sequence encoding at least a portion of the a subunit of an 
RNAP obtained from the same genus as the source of the host cell, and a second nucleic 
acid sequence encoding at least a portion of the a subunit of an RNAP obtained from the 

15 same genus as the source of the heterologous gene; and with at least one heterologous gene; 
and culturing the transformed host cell under conditions wherein the heterologous gene is 
expressed in the host cell. The method further comprising transforming the host cell with 
at least one gene encoding a heterologous transcriptional regulator obtained from the same 
genus as the source of the heterologous gene. The heterologous gene may comprise 

20 multiple genes or operons in the same metabolic pathway that catalyze the production of a 
product. 

The host cells useful in the present method are prokaryotic cells or eukaryotic cells. 
Preferably, the host cell is selected from a prokaryotic cell selected from the group 
consisting of Gram-negative bacteria, Gram-positive bacteria, aerobic bacteria, acid-fast 

25 bacteria, mycoplasma, anaerobic bacteria, and facultative bacteria. Preferably, the host cell 
is obtained from the genus Escherichia, Pseudomonas, Bacillus, Lactobacillus, Clostridium 
or Rhizobium. Eukaryotic cells may be selected from known mammalian cells that can be 
grown in cell culture, such as CHO, BHK, COS. 

The present invention also is directed to a method of expressing at least one 

30 heterologous gene in a host cell comprising transforming the host cell with a vector 

comprising a nucleic acid molecule encoding a complete a subunit of an RNA polymerase 
obtained from the same genus as the heterologous gene or a portion of the a subunit of the 

11 
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RNA polymerase, and also transforming the host cell with at least one vector comprising at 
least one heterologous gene; and then culturing the transformed host cell under conditions 
where at least one of the heterologous genes is expressed in the host cell. The method 
further comprises transforming the host cell with at least one gene encoding a heterologous 

5 transcriptional regulator obtained from the same genus as the source of the heterologous 
gene. The transcirptional regulator is a transcriptional activator that interacts with the a 
subunit of the RNAP to enhance the expression of the heterologous gene. The heterologous 
gene may comprise multiple genes or operons in the same metabolic pathway. 

Preferably, the host cell is transformed with two vectors, one containing the rpoA 

10 gene or a portion thereof and one containing at least one heterologous gene. However, the 
host cell may be transformed with a single vector containing the rpoA and heterologous 
genes, but more than two vectors can be used to introduce the genes into the host cell 
depending on the number of genes and the size of the nucleic acid to be introduced. 

The host cell may be a prokaryotic cell or a eukaryotic cell. Preferably the host cell 

15 is a prokaryotic cell selected from the group consisting of Gram-negative bacteria, Gram- 
positive bacteria, acid-fast bacteria, mycoplasma, aerobic bacteria, anaerobic bacteria, and 
facultative bacteria. More preferably the host cell is from the genus Escherichia, 
Pseudomonas, Bacillus, Lactobacillus, Clostridium or Rhizobium. Most preferably the host 
cell is an Escherichia. 

20 The method is useful for the expression of at least one heterologous gene, where the 

genus of the source of the heterolgous gene is a prokaryotic cell selected from the group 
consisting of Gram-negative bacteria, Gram-positive bacteria, aerobic bacteria, anaerobic 
bacteria, and facultative bacteria. More preferably the genus of the source of the 
heterologous gene is Agrobacterium, Mycobacterium, Chlamydia, Pseudomonas or 

25 Streptomyces, and most preferably, the genus is obtained from Agrobacterium. 

The method utilizes a nucleic acid molecule that encodes the a subunit of the RNAP 
the amino acid sequence as depicted in Figure 2 A or at least a portion of the amino acid 
sequence depicted in Figure 2A. The portion of the amino acid sequence useful in the 
present method may be from the N-terminus or C-terminus of the a subunit of the RNAP 

30 but preferably comprises at least amino acid residues 157 to 336 of the amino acid sequence 
depicted in Figure 2A or the portion of the amino acid sequence may comprise less than 
amino acid residues 157 to 336 of the amino acid sequence depicted in Figure 2A. The 
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portion of the amino acid sequence should comprises at least 8 consecutive amino acid 
residues of Figure 2A. More specifically, the portion of the amino acid sequence as 
depicted in Figure 2A comprises amino acid residues 329 to 336. 

The present invention also encompases a purified a subunit of an RNAP of 
5 Agrobacterium. More specifically, the purified a subunit of an RNAP of Agrobacterium of 
the present invention is the amino acid sequence depicted in Figure 2A. The invention is 
also intended to encompass portions of the purified a subunit of an RNAP disclosed in 
Figure 2A. More specifically, the purified a subunit of an RNA polymerase of 
Agrobacterium or a portion thereof is encoded by the nucleic acid is depicted in Figure 2B 

10 or a portion thereof. 

A rpoA gene from the same genus as the source of a heterologous gene encoding the 
a subunit from this source is obtained by methods well known to persons skilled in the art. 
For example, this gene can be obtained: (1) by complementation of the a temperature 
sensitive rpoA mutant strain of a known organism, i.e. E.coli as discussed below in the 

15 preferred embodiment; (2) by Southern hybridization based on DNA sequence homology; 
(3) by Western blot based on cross reactivity of an antibody against a known RNAP a 
subunit; and (4) by PCR amplification based on conserved DNA sequences among known 
rpoA genes. For their expression, the rpoA genes are fused behind or operably linked to a 
promoter that is known to be operable in the host. The fusion is either a transcriptional 

20 fusion or translational fusion. Many standard methods are well known to persons skilled in 
the art and can be utilized to carryout these methods (See reference 68 which is herein 
incorporated in its entirety by reference.) 

The following specific examples set forth below serve to further illustrate the 
present invention in its preferred embodiments, and are not intended to limit the present 

25 invention to these examples. 

EXAMPLES 

Bacterial strains, plasmids and media . 

All strains and plasmids used or constructed are listed in Table 1 . Bacterial strains 
30 were grown in either LB medium (40), Mannitol Glutamate Luria salts (MG/L) medium 
(58), or Induction medium containing 1 % glucose (61) at 28°C. Induction medium was 
used for attempts to reconstitute wild type vir gene induction, while MG/L and LB medium 
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were utilized for strains containing virG™ (i.e. pSY215 and pLG2). When appropriate, 
media was supplemented with ampicillin (100 ug/ml), gentamycin (20 ug/ml), kanamycin 
(50 p,g/ml) and tetracycline (20 ug/ml) for E. coli and carbenicillin (100 ug/ml), 
gentamycin (100 ug/ml), kanamycin (100 ug/ml) and tetracycline (5 ug/ml) for A. 

5 tumefaciens. For determinations of p-galactosidase activity, 5-bromo-4-chloro-3-indoly-(3- 
D-galactopyranoside (X-Gal) and isopropylthio-p-galactoside (IPIG) were included at a 
final concentration of 75 ug/ml and 1 mM, respectively. Acetosyringone (Sigma) and 
glucose was included, when necessary at 200 uM and 1 %, respectively. For induction 
assays, E. coli stain MC4100 containing constructs indicated in Table 1 were grown to 

10 stationary phase in 5 ml of appropriate medium, containing antibiotics as required. These 
overnight cultures were used to inoculate 125 ml flasks containing 30 ml of identical 
medium using 0.5 ml as an inoculum. The cultures were incubated at 28°C with shaking 
for 16 hours and assayed for p-galactosidase activity according to the method of Miller 
(40). For reconstitution of virulence gene expression in E. coil, plasmid constructs 

15 pSY204, pLG2, pGP159and pSL107 were introduced into MC4100 by electroporation. 

Construct pH098 containing lac-driven A. tumefaciens rpoA was then introduced into these 
strains and initially screened on LB medium (pSY204 and pLG2) or induction medium 
(pGP159and pSL107) containing appropriate antibiotics, 1 mM IPTG, 75 mg/mi X-Gal and 
200 uM acetosyringone. Induction assays for these strains were carried out as described 

20 above. 

Isolation and subclo ninp of r poA locus 

A cosmid clone library of A. tumefaciens strain A136, a derivative of strain C58 
lacking the Ti plasmid, in pVK102 was described previously (8). The cosmid clones were 
transformed into E. coli strain MC4100 containing plasmid pSY215. Transformants were 

25 initially screened for the development of a blue color on induction media containing X-Gal, 
indicating expression of the virBp::lacZ fusion. A cosmid clone, designated pBK2, was 
isolated by this procedure and was used for subsequent subcloning attempts, as outlined in 
Figure 1. 

DNA sequencing was performed by PCR mediated Taq DyeDeoxy Terminator 
30 Cycle sequence on an Applied Biosystems model 377 DNA sequencer. The GCG sequence 
analysis software package (Genetics Computer Group, Madison, WI) and the BLAST 
software package (2) were used for all DNA and protein sequence analysis. The complete 
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nucleotide sequence of the 1.3 Kb Pstl-StuI DNA fragment of pPS1.3 was determined on 
both strands. The rpoA sequence has been deposited into GenBank (accession # AF1 11855) 
Overexpression and purification of proteins 

For overproduction of the RpoA proteins, rpoA genes from E. coli and A. 
5 tumefaciens were PCR amplified from the MC4100 chromosome and plasmid pBX4. 1 
(Figure 1), respectively, using primers designed to contain a BamHI restriction site. [E. 
colt 1)5'-CCA AAG AGA GGA TCC AAT GCA GGG-3', 2) 5'-CCT TAA CCT GGG 
ATC CGG TTA CTC G-3';A tumefaciens 1) 5'-GGA AGG ATC CAA GAT GAT TCA 
GAA GA-3\ 2) 5'-CCT GGAA TCC TGC AGA TGA CTT ATC TG-3']. The PCR 

10 products were initially cloned into PCR2.1TOPO (Invitrogen, Inc.), and then subcloned 
into pQE vectors (Qiagen) to generate pECH4 and pZL-2, containing N-terminal His- 
tagged fusions to RpoA of E. colt and A. tumefaciens, respectively. 

The His-tagged RpoA proteins of A. tumefaciens and E. coli were prepared from E. 
coli strain SG13009 (pREP4) containing pZL-2 or pECH4, respectively. Cells were grown 

15 at 37°C in 500 ml of L-Broth containing selective antibiotics to an ODeoQ of 0.3, and then 
induced with 1 mM IPTG for 3 hours at 37°C. Cells were harvested, resuspended in 20 ml 
of binding buffer (20 mM Tris-HCl pH7.9, 500 mM NaCl, 5% Glycerol) and lysed by 
sonication. The lysates were cleared by low speed centrifugation and loaded on a Fast 
Flow Chelating Sepharose Column charged with Ni 2+ (Pharmacia, Inc.). The column was 

20 attached to a Waters 650 Advanced Protein Purification System, washed extensively with 
loading buffer containing 25 mM imidazole, and bound protein eluted with buffer 
containing 100 mM imidazole. Fractions containing greater than 90% RpoA were 
precipitated by addition of ammonium sulfate to 75% saturation. The resulting pellet was 
dissolved in a storage buffer (20 mM Tris pH7.9, 40 mM KC1, 10 mM MgCl 2 , 1 mM 

25 DTT, 50% Glycerol) and stored at 20°C. 

Recombinant p P' and a 70 subunits of £. coli RNAP were each purified from 
overexpression strains as described previously (16, 51). Induction was initiated with 1 mM 
IPTG at 37°C for 3 hours and the proteins present in inclusion bodies were then purified 
according to Tang (55). The constitutively active VirG protein (VirG con ) was purified from 

30 E. coli as described previously (30). 

RNAP holoenzymes were reconstituted from individually purified E. coli p p' and 
a 70 , and either E. coli or A. tumefaciens RpoA as described (3). The molar ration of a, p 
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and p* in the reconstitution reactions was 1:4:8. After reconstitution and thermoactivation 
in the presence of a 70 , RNAP preparations were further purified by gel exclusion 
chromatography on a Superose-6 column (Pharmacia) and ion exchange chromatography on 
a Resource Q column (Pharmacia). The purified RNAP holoenzymes were concentrated by 

5 filtration through a C-100 concentrator (Amicon, Inc.) to "1 mg/ml, and stored in 50% 
glycerol storage buffer at -20°C. 
In Vitro Transcription Assays 

For analysis of abortive initiation, 2 pmol of recombinant RNAP holoenzyme 
containing either E. coli or A. tumefaciens RpoA was incubated with 10 pmol of template 

10 DNA in 10 fi\ of transcription buffer (40 mM Tris-HCl pH7.9, 40 mM KC1, 10 mM 

MgCl 2 ) for 10 minutes at 23 °C in the presence or absence of the VirG 00 " protein (2 pmol). 
As templates, a 130 bp EconRI-Hindm fragment of plasmid pAA121 containing the ga/Pl 
promoter (-63 to +44) or a 380 bp PCR product of plasmid pSM243cd (Stachel 1985) 
containing the virB promoter (-248 to +81) were used (4, 10). Abortive initiation 

15 reactions were initiated by addition of 0.5 mM initiating dinucleotide (CpA for ga/Pl 
promoter and ApU for virB promoter) and 0.5 fiM [a- 33 p] UTP (ga/Pl) or [a- 33 p] ATP 
(virB) (3000Ci/mmol). After 25 incubation at 37 °C, reactions were terminated by the 
addition of an equal volume of urea loading buffer. The reaction products were resolved on 
urea-PAGE (20% poly aery lamide, 19:1 acrylamide:bis) and visualized by autoradiography. 

20 Mobility shift assays 

Electrophoretic mobility shift DNA binding assays were carried out using a PCR 
amplified virB promoter labeled with (ct- 32 P] dCTP. PCR reactions included 20 jxCi (a- 
32 P]dCTP, plasmid pSM243cd as template, Primer 1:5'-TTC CAC GGT GAC GCA TCG 
AAT G-3\ and Primer 2:5'-CCC CGA TCT CTT AAA CAT ACC TTA TCT CC-3'. 

25 Unincorporated nucleotides were removed using the Wizard™ PCR Preps DNA 

Purification System kit from Promega. Mobility shift reaction mixtures contained 1600 
cpm 32 P-labeled virB promoter, 50 mM KC1, 20 mM Tris-HCl pH 7.0, 10 mM MgCl 2 , 1 
mM DTT, 10% Glycerol and 20 ug/ml Herring sperm DNA. Where indicated, 1.5 \xM 
VirG was added, which was sufficient to saturate all VirG binding sites on the promoter, 

30 and His-RpoA was added at 150-500 nM final concentration. Reactions were incubated for 
30 minutes at 22°C, loaded onto 6% polyacrtlamide, 10% glycerol vertical slab gels in 
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0.5X TBE buffer (46) and electrophoresed at 20V/cm at 4°C for 2 hours. Following 
electrophoresis the gels were dried and autoradiographed overnight at room temperature. 
Identification of an A. tumefaciens gene that is required for vir gene e xpression in E. coli 
To reconstitute vir gene induction in E. coli, plasmid construct pGP159 containing 
5 vir A and virG under the control of their native promoters and a lacZ reporter gene fused 
. downstream of the virB promoter (virBp::lacZ fusion) was introduced into E. coli strain 
MC4100. The resulting strain failed to activate transcription of the fusion in response to 
acetosyringone (AS). The inability of pGP159 to activate transcription may have been due 
to a lack of expression of the virA/virG genes, which are under the control of their native 

10 promoters. To address this possibility, the plasmid pSL107 was utilized, which contains 
vir A and virG under the control of the lac promoter and virBp.lacZ. Introduction of 
pSL107 into MC4100 also failed to activate expression of the vir gene fusion in the 
presence of AS and IPTG. The observed lack of expression was not due to a lack at 
expression of virA and/or virG as both proteins were present at detectable levels by Western 

15 blot (data not shown). Since VirA and VirG were present, the VirA/ VirG signal 

transduction mechanism may not be functional in E. coli. This possibility was addressed 
through the use of pSY215 which contains a constitutively active virG 00 " under the control 
of the lac promoter and vtrBp::lacZ. The vir<J° n is able to activate expression of the 
virBp::lncZ fusion in A. tumefaciens strains independent of virA and acetosyringone (17, 

20 31, 45). When pSY215 was introduced into MC4100, once again no expression of the 

virBprAacZ fusion was observed. This suggested that additional genes from/i. tumefaciens 
may be required or that E. coli may contain specific repressors of vir gene expression. 

To determine if additional genes from A. tumefaciens were required, a cosmid 
library constructed from chromosomal DNA of a Ti plasmidless strain A136, was 

25 introduced into MC4100(pSY215). Screening of the resulting transformants revealed the 
presence of a clone which produced a light blue color on colonies grown on induction 
media containing X-Gal and IPTG. This cosmid clone, designated pBK2, contains a 25 Kb 
DNA insert. In order to identify the gene residing in pBK2 required for expression of 
virBp::lacZ t overlapping subclones of pBK2 were generated which were introduced into 

30 MC4100(pSY215). Expression of the fusion was detected following introduction of 

pBX4.1, but not pBKS7.0 or pBKE4.8 (Fig. 1 & Table 2). The inability of pBKS7.0 and 
pBKE4.8 to activate expression of the fusion indicated that the 0.55 Kb region between 
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these two subclones is required for expression of the fusion. Given this information, a 1.3 
Kb Stul-PstI DNA fragment from pBX4.1 was isolated and subcloned it into pTZ18R and 
pTZ19R yielding pPS1.3R and. PPSI.3, respectively. This resulted in two constructs in 
which the lac promoter on the vector drives transcription from either end of the fragment. 
5 Expression of the vir fusion was detected only with pPSl .3, indicating the absence of 
promoter elements in the fragment and the responsible gene is in the direction of PstI to 
Srul. Subclone pPS1.3 resulted in the highest level of expression of the fusion, almost 15 
fold higher than pBK2, and 40 fold higher than the vector control. This could be due to 
both increased copy number of the gene and increased gene expression by the strong lac 

10 promoter (Table 2). 

Characterization of the identified gene 

The DNA sequence of the entire 1.3 Kb Stul-PstI fragment was obtained from both 
strands. Analysis of the DNA sequence revealed an open reading frame of 1,008 base 
pairs, in the predicted transcriptional direction, sufficient to encode a polypeptide of 336 

15 amino acids as shown in Figure 2A. Nucleotide and protein searches of GenBlank and 
Swiss-Prot databases indicated a high degree of similarity to rpoA genes, encoding the a 
subunit of RNAP. Comparison with rpoA from E. coli indicated 62. 15% sequence 
similarity and 51 A % sequence identity at the amino acid level as shown in Figure 3. Three 
highly conserved regions can be identified from the distribution of homologous amino acid 

20 residues between E. coli and A. tumefaciens rpoA homologues. One region extends from 
residue 30 to 51 near the N-terminus (20 of 22 identical), with the other two present in the 
C-terminal domain extending from residue 256 to 270 (13 of 15 identical) and from residue 
276 to 315 (30 of 42 identical). A notable difference is the presence of an additional eight 
residues at the C-terminus of the A. tumefaciens RpoA, compared to RpoA of E. coli. A 

25 potential Shine-Dalgamo sequence, GAAGGT, was found extending from -7 to -12 bp 

upstream of the proposed ATG initiation codon of rpoA. Analysis of partial DNA sequence 
obtained from pBKS7.0 and pBKE4.8 indicated the presence of regions upstream and 
downstream of rpoA with a high degree of sequence similarity to rpsK and rplQ, encoding 
Sll and LI 7 ribosomal proteins, respectively, possibly forming an operon structure similar 

30 to E. coli. 

Confirmation of the ORF was achieved by construction of a His-tagged rpoA fusion 
as described above, which yielded a polypeptide of the predicted size (" 37 Kd) that is 
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slightly larger than E. coli RpoA (Fig. 4A). Furthermore, the DNA clone from A. 
tumefaciens was able to complement a temperature sensitive rpoA mutant of £. coli, 
HN317, proving that it does encode a functional homologue of the RNAP a subunit. 
In vitro transcription of the virB promoter 
5 In vitro assays involving the rpoA subclones and a constitutive virG con , in vitro 

transcription assays were performed. Purified individual components of E. coli RNAP, p 
p' and a 70 , were mixed with either His-RpoA of E.coli or A. tumefaciens and high 
molecular weight RNAP complexes were purified by sizing column (see Materials and 
Methods). As shown in Figure 4 A, both RpoA molecules were able to successfully 

10 assemble with the E. coli P P' and a 70 subunits into complete RNAP holoenzymes. When 
tested for in vitro transcription, both RNAP holoenzymes were equally efficient in initiating 
transcription from a a 70 dependent E. coli ga/Pl promoter (Fig. 4B), demonstrating that the 
hybrid RNAP containing RpoA (ot-subunit of RNAP) of A. tumefaciens is a functional 
enzyme. Furthermore, no significant differences were detected in the amount of the 

15 transcript produced in the presence or absence ofvirCF on . When the virB promoter was 
used as a template, the E. coli KNAP could activate low level transcription but no 
difference was evident with or without the VirG"" 1 protein. In contrast, the hybrid RNAP 
containing A. tumefaciens RpoA was able to activate transcription from the virB promoter 
at low levels, and addition of the VirG 00 " increased transcription by 4-5 fold as measured by 

20 quantification of the gel (Fig. 4B). These results confirm the in vivo assays demonstrating 
that only RNAP containing RpoA of A. tumefaiens is able to efficiently initiate transcription 
from the virB gene promoter in a VirG"" 1 dependent manner. The inability of E. coli 
RNAP to efficiently express the vir fusion even in the presence of the VirG 0011 suggests that 
E. coli RpoA can not make the required contacts with the VirG con protein. 

25 Specific interaction between the VirG*™ and RpoA of A. tumefaciens 

Since the C-terminal domain of the E. coli RpoA was known to interact with the 
A+T rich "UP element of certain promoters (Ross 1993), the E. coli and A. tumefaciens 
RpoA proteins were tested to determine if these proteins have different affinities for the 
virB promoter. As shown in Figure 5 (lanes 3, 4 and 5), A. tumefaciens RpoA was able to 

30 shift the mobility of the labeled virB promoter at the highest concentration used (600nM), 
whereas the same concentration of E. coli RpoA did not, suggesting that the RpoA of A. 
tumefaciens has a higher affinity for the virB promoter than RpoA of E. coli. Similar 
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mobility shift assays were carried out to determine if His-RpoA of E. coli and A 
tumefaciens interact differently with VirG 0011 at the virB promoter. Increasing amounts of 
RpoA were used in. combination with a saturating quantity of VirG 0011 for the virB promoter 
(Fig. 5, lane 2) The concentration of VirG con used was determined through separate 
5 mobility shift assays in which increasing amounts of VirG"" 1 resulted in two separate shifts 
in mobility, corresponding to binding of VirG 0011 at the two vir boxes of the virB promoter 
(data not shown). At the three concentrations of E. coli RpoA used, there was no 
additional shift in the mobility of the promoter VirG 0011 complex (Fig. 5 A, lanes 6. 7 and 8). 
However, when RpoA from A. tumefaciens was used, two separate shifts were observed 

10 (Fig 5B, lanes 6, 7 and 8) which suggests specific interactions between VirG con and RpoA 
from A. tumefaciens. These results provide ftirther evidence suggesting that RpoA from A. 
tumefaciens may possess a higher affinity for VirG con compared to RpoA from E. coli. 
Combination of virA/G and rvoA of A. tumefaciens is insufficient to reconstitute 
acetosvringone mediated vir gene induction in E. coli 

15 To determine if the signaling mechanism, resulting in vir gene activation, can be 

reconstituted in E. coli, a wild type virA and virG was used in combination with A. 
tumefaciens rpoA. M04100 harboring two plasmid constructs, one containing a 
virBpr.lacZ fusion as well as wild type virA/virG under their native promoters (pGP159), 
and. the other containing a toe-driven A . tumefaciens rpoA gene (pH098), did not show any 

20 significant increase in p-galactosidase activity in the presence of acetosyringone (data not 
shown). The possibility that lack of expression of vir A and/or virG may account for this 
result was again addressed through the use of pSL107 which contains lac-driven virA/virG 
and virBpr.lacZ. However, it was not possible to obtain significant expression of the fusion 
in MC4100 harboring pSL107 and pH098. Introduction of pH098 into MC4100(pLG2), 

25 containing lac-driven VirG™ and virBp::lacZ, however, resulted in a significant increase in 
(3-galactosidase activity, demonstrating that pH098 is able to produce a functional RpoA 
protein. These results suggest that the signal transduction mechanism of VirA/VirG may 
not be functional in E. coli. 

The present invention discloses that it is possible to obtain the expression of 

30 heterologous genes that previously could not have been expressed in a host, such as E. coli, 
by including a rpoA gene or a portion thereof from the same host genus as the expressed 
gene is obtained The ability to use E. coli as a heterologous system provides investigators 

20 
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with a valuable tool for studying these various processes. The identification of a 
chromosomally encoded A. tumefaciens rpo A gene and the demonstration that it constitutes 
one of the components required for expression of a virBp::lacZ gene fusion in a 
heterologous E. coli background supports the present invention. The rpoA gene of E. coli 
5 has been extensively studied, particularly with regard to interactions with transcriptional 
regulators, and suggests that interaction between VirG and RpoA may be required for 
efficient transcription of virulence genes. 

The inability of pGP159 or pSL107 to activate transcription of the virBp::lacZ 
fusion suggested that either signal transduction between VirA and VirG was not functional, 

10 or that additional gene(s) were required from A. tumefaciens for activation. The use of 
pSY215 containing a virG con mutant allowed us to evaluate vir gene expression in a vir A- 
independent manner, eliminating the need for signal transduction. The lack of expression 
obtained with pSY215 in E. coli, combined with its ability to function in a Ti plasmidless 
A. tumefaciens strain suggested that additional A. tumefaciens gene(s) were required for 

15 expression. The introduction of pPS 1.3 containing lac driven rpo A into MC4100(pST2 15) 
resulted in a significant increase (40 fold) in transcription of the virBp::lacZ fusion 
compared to the control vector. Verification that rpo A is required was obtained through the 
use of subclone pPS1.3R, which did not activate expression of the fusion. This construct is 
identical to pPS1.3, but the direction of transcription of rpo A is opposite to that of the lac 

20 promoter. The observation that A. tumefaciens rpoA was able to complement a temperature 
sensitive rpoA mutant in E. coli demonstrates an ability to function at essential E. coli 
promoters. This is evident from the in vitro transcription assay where the hybrid NAP was 
equally effective as E. coli RNAP in transcribing a a 70 -dependent ga/Pl promoter (Fig. 
4B). While the expression of the virBp::lacZ fusion (Table 1) was significantly increased, 

25 the level of expression was relatively low in comparison to expression in A. tumefaciens 

(31). The relatively low expression of the virBp:;lacZ fusion may have been a consequence 
of the presence of RNAP containing RpoA of E. coli. In order to remove possible 
interference from £. coli RpoA, in vitro transcription assays using reconstituted RNAP 
holoenzymes containing His-RpoA from either £. coli or A. tumefaciens were carried out. 

30 Using purified E. coli p, P' and a 70 subunits, it was demonstrated that both of the His- 

RpoA were able to assemble into multi-subunit RNAP holoenzymes (Fig. 4A). The results 
of the in vitro transcription assays demonstrated that VirG^-dependent transcription of the 
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virBp::lacZ fusion requires RNAP containing A. tumefaciens RpoA, although the two 
reconstituted holoenzymes exhibited essentially identical activity in transcription from the 
ga/Pl promoter, with no significant difference in the presence or absence of VirG con (Fig 
4B). Another possible explanation for the relatively low induction in E. coli may due to the 
5 presence of E. coli sigma factors in the RNAP holoenzymes. It is conceivable that E. coli 
sigma factors have a lower affinity for the virB promoter than sigma factors from A. 
tumefaciens. Although the vegetative sigma factor from A. tumefaciens has been identified 
(50), it is unclear whether this or an alternative sigma factor is involved in vir gene 
transcription. 

10 Previous reports have identified the presence of an "UP element" in certain E. coli 

promoters which is required for optimal transcription (14, 41, 47). This element extends 
from -40 to -60 bp upstream of the transcription start site and is highly A+T rich. 
Interestingly, the virB promoter contains an A+T rich sequence from -40 to -60 that 
overlaps wit the VirG binding sites. Whether this region of the promoter constitutes a true 
15 "UP element" is unknown. From the gel shift assays, the A. tumefaciens RpoA appears to 
have a higher affinity for the virB promoter than E. coli RpoA (Fig 5 A and B), although the 
importance of this observation is unclear at this time. The results of the mobility shift 
assay suggest that E. coil RpoA is unable to bind to VirG™ at the virB promoter. In 
contrasts, A. tumefaciens RpoA appears to exhibit cooperative binding with two distinct 
20 shifts in the mobility of the VirG on -virB promoter complex. Taken together, these results 
indicate that RNAP containing E. coli RpoA may be unable to interact effectively with 
VirG"", and therefore can not activate transcription from the virB promoter. Since the virB 
promoter contains two binding sites for VirG, the presence of two shifts obtained with 
increasing amounts of A. tumefaciens RpoA may be a consequence of RpoA interacting 
25 with VirG at each vir box. 

The two component regulatory system, composed of vir A and vz'rG, is indispensible 
for transcription of virulence genes within A. tumefaciens. However, vir A and vz'rG are 
insufficient to activate transcrition from virulence gene promoters within E. coli cells, 
indicating a requirement for additional A. tumefaciens gene(s). 
30 In examining vir gene expression in E. coli, attempts to reconstitute wild type 

virulence gene expression in E. coli were not successful. The use of vir A and virG under 
the control of the lac promoter means that sufficient levels of vir A and vz'rG should be 
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present for signal transduction to take place. One possible explanation may be that E. coli 
is unable to correctly insert virA into the inner membrane. Alternatively, even, though 
virA may be inserted into the inner membrane correctly, dimerization of virA which is 
required for activity in. A. turnefuciens (43) may be defective. A more likely explanation 
5 may be that additional gene(s) from A. tumefaciens are required for efficient signal 
transduction. An unresolved question is the exact mechanism of sensing of phenolic 
inducers by the virA/virG system. The two possible mechanisms involve direct binding of 
the inducer by virA, or binding by a second receptor which then interacts with virA. 
Although genetic evidence supporting direct binding of inducers by virA has been reported 

10 (36, 37), all attempts to demonstrate direct binding by virA have been unsuccessful. 

Conversely, there have been reports in which binding of phenolic compounds by proteins 
other than virA have been detected (13, 35), although there is no evidence to link these 
proteins with vir gene induction. The search for additional A. tumefaciens gene(s) involved 
in the signal transduction should be simplified by determining that v/rG 00 " mediated 

15 expression of virulence genes requires RpoA from A. tumefaciens. The present invention 
provides the basis to examine vir gene expression as well as the T-DNA transfer process in 
E. coli. 

This specific example shows that for the expression of vir genes in E. coli, both 
rpoA and virG from A tumefaciens are required for transcriptional activation of a vir 

20 promoter in E. coli. It has been determined that the rpoA gene, encoding the a subunit of 
RNAP, confers significant expression of a virBp;;lacZ fusion in E.coli in the presence of an 
active transcriptional regulator virG gene. In vitro transcription assays were conducted 
using either reconstituted E. coli RNAP or hybrid RNAP in which the a subunit was 
derived from A. tumefaciens. Both RNAPs were equally efficient in transcription from a 

25 a 70 - dependent E. coli galPl promoter, however, only the hybrid RNAP was able to 
transcribe the virB promoter in a v/rG-dependent manner. 

Evidence is also presented which indicates that virG interacts with rpoA from A. 
tumefaciens but not with rpoA from E. coli. This observation suggests that in order for 
successful transcription of vir genes to occur, specific interaction(s) between the A. 

30 tumefaciens a subunit of RNAP and virG is required. 

As shown in Figure 3, one of the major difference between the RpoA sequences of 
A. tumefaciens and E.coli is the presence of an extra 8 consecutive amino acids at the C- 
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terminus of A. tumefaciens RpoA (amino acids 329 to 336). To test if these C-terminal 8 
consecutive amino acids are important in mediating vir gene activation in E.coli, a plasmid 
construct was generated from pPS1.3 by deleting the DNA coding for the last 8 amino acids 
using site directed mutagenesis. The resulting plasmid pAD8 encodes 328 amino acid long 
5 A. tumefaciens RpoA, lacking the original amino acids from 329 to 336. 

The vir gene activating ability of this construct was tested by introducing pAD8 into 
E.coli strain MC4100 harboring pSY215 which contains virA, virG and virB::lacZ fusion. 
As negative and positive controls, vector plasmid pQE31 or pPS1.3 which encodes intact 
RpoA of A. tumefaciens was introduced into MC4100(pSY215), respectively. The vir gene 

10 activation was monitored by measuring the p-galactosidase activities. As shown in Table 3, 
pAD8 failed to activate p-galactosidase activity whereas the pPS1.3 induced a high level P- 
galactosidase activity. These results indicate that the C-terminal 8 amino acids of A. 
tumefaciens RpoA are essential for the vir gene activation in E.coli. 

These experiments also demonstrate that incorporation of a rpoA gene of a 

15 prokaryote, such as A.twnefaciens, into an expression vector will promote the expression of 
genes from the prokaryotic cells in other prokaryotic cells, such as E. coli. Expression of 
this expression vector comprising at least a portion of an A. tumefaciens gene is a useful 
gene delivery system in heterologous hosts for gene delivery and gene therapy. Further, 
incorporation of at least a portion of an rpoA gene derived from the same cells as one or 

20 more heterologous genes, into a vector containing the heterologous genes, provide 

improved systems for expressing the heterologous genes. These expression vectors which 
comprise one or more heterologous genes and at least a portion of an rpoA gene derived 
from the same cells as the heterologous genes, are useful in the production of selected 
metabolites, by expression genes for an entire metabolic pathway in a heterologous host. 

25 The following list of publications referred to in the specification are herein 

incorporated in their entirety by reference. 
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Bacterial Strains and plasmids used in this study 



Strain or plasmid 


Genotype or phenotype 


source 


E. coli 






DH5cc 


recA endAI hsdR17 supE4 gyrA96 reLAI (lacZYA- 


(56) 




argF)U169 (80dlacZM15) 




MC4100 


F araD139 (argF-kac)Ul69 rpsL150 relA fib5301 


(57) 




ptsF25deoCl 




HN317 


rpoAm^St 1 


(58) 


Plasmids 






pGP159 


virA. virG & virBp: :lacZ; IncP, Ap\TC 


(59) 


pSM234acd 


virA & virBp: :lacZ: IncP. Km r 


/A C\ 

(45) 


pLG2 


Zac/>::vzrG(N54D) & vzrSp: :lacZ; IncP, ApTc 


lnis study 


pSL107 


: :virG & v/rBp: :focZ:IncW, AFTc 1 


lnis study 


pPC401(N54D) 


lacp: :vz>G(N54D) in pTZ18R.Ap' 


This study 


pSY215 


virflfp: :lacZ & Zacp: :virG(N54D); IncW, Gm r 


This study 


pVK102 


Cosmid clonmg vector; IncP, Km ,Tc 


(60) 


pTZ18R/pTZ19R 


Cloning vector; ColEl.Ap r 


T TOT* 

USB 


pBK2 


pVK102 with 25 Kb chromosomal insert from 


This Study 




A. tumefaciens strain A136, Tc l 




pBKS2-2 


pTZ18R with 7.1 Kb SauiA insert from pBK2 


This study 


pBX4.1 


pTZ18R with 4.3 Kb BamHI-Xbal insert from pBKS2-2 


This study 


pBKS7.0 


pTZ18R with 4.2 Kb Saw3A-Sa/I insert from pBKS2-2 


This study 


pBKE4.8 


pTZ18R with 1.7 Kb EcdRl-Sau3A insert from pBKS2-2 


This study 


pPS1.3R 


pTZ18R with 1.3 Kb Pstl-Stul insert from pBKS2-2 


This study 


pPS1.3 


pTZ19R with 1.3 Kb Pstl-Stul insert from pBKS2-2 


This study 


pZL-2 


Overproducer of A. tumefaciens His-RpoA: Ap l 


This study 


pECH4 


Overproducer of £. coli His-RpoA.Ap 1 


This study 


pH098 


lacp::rpoA of A. tumefaciens from pPS1.3: IncW.Km 1 


This study 
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Table 2. RpoA mediated expression of virBP::IacZ fusion in E. coli MC4100 
containing virG* s{ 

Constructs LB medium MG/L medium 

p-Galactosidase b Fold Activation 0 p-Galactosidase b Fold Activation 



pBK102 


2.46 


NA 


2.35 


NA 


pBK2 


5.2 


2.11 X 


6.56 


2.79 X 


pTZ18R 


2.24 


NA 


2.51 


NA 


pBKS2-2 


13.67 


6.1 X 


7.25 


2.89 X 


pBX4.1 


26.42 


11. 8X 


8.11 


3.23 X 


pBKS7.0 


1.68 


0.75 X 


2.46 


0.98 X 


pBKE4.8 


2.36 


1.05 X 


1.63 


0.65 X 


pBP3.0 


5.45 


2.43 X 


2.83 


1.13 X 


pPS1.3R 


2.13 


0.95 X 


1.71 


0.68 X 


pPS1.3 


76.14 


40.0 X 


37.69 


15.0 X 



The virBpyJacZ expression assays were carried out in LB and MG/L medium for 16 
hours as described in the text, IPTG was added at a final concentration of 1 mM. E. coli 
MC4100 containing pSY215 (v/rCT*" + vz>5p::lacZ). b Values are the average of three 
replicates. 

c Values indicate an increase or decrease in activity compared to the appropriate 
control vector. NA, Not Applicable. 



Table 3. p-galactosidase activities 



E.coli strain MC4100(pSY215) 
plus plasmid shown in right 


pPS1.3 


pAD8 


pQE31 


P-galactosidase in 
Miller Units* 


140(+/-25) 


4(+/-l) 


3 ( + /-2) 



* Miller Units = Defined in Ref. 68. 
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We claim : 

1 . An isolated nucleic acid molecule encoding at least a portion of the a subunit 
of an RNAP of Agrobacterium. 

2. An isolated nucleic acid molecule encoding the complete a subunit of an 
RNAP of Agrobacterium, 

3 . An isolated nucleic acid molecule encoding the complete a subunit of an 
RNAP of Agrobacterium, wherein the amino acid sequence of said a subunit is depicted in 
Figure 2A. 

4. An isolated nucleic acid molecule encoding at least a portion of the a subunit 
of an RNAP of Agrobacterium, wherein the amino acid sequence of said a subunit is 
depicted in Figure 2A. 

5. The nucleic acid molecule of claim 4, wherein said portion of the amino acid 
sequence as depicted in Figure 2A comprises at least amino acid residues 157 to 336. 

6. The nucleic acid molecule of claim 4, wherein said portion of the amino acid 
sequence as depicted in Figure 2A comprises less than amino acid residues 157 to 336. 

7. The nucleic acid molecule of claim 4, wherein said portion of the amino acid 
sequence as depicted in Figure 2A comprises at least 8 consecutive amino acid residues in 
length. 

8 . The nucleic acid molecule of claim 7, wherein said portion of the amino acid 
sequence as depicted in Figure 2A comprises amino acid residues 329 to 336. 

9. An isolated nucleic acid molecule comprising the nucleic acid sequence as 
depicted in Figure 2B. 

10. An isolated nucleic acid molecule comprising at least a portion of the nucleic 
acid sequence as depicted in Figure 2B. 

11. A hybrid nucleic acid molecule for the expression of at least one 
heterologous gene in a host cell, wherein said hybrid nucleic acid molecule comprises a 
first nucleic acid sequence encoding at least a portion of the a subunit of an RNAP obtained 
from the same genus as the host cell and a second nucleic acid sequence encoding at least a 
portion of the a subunit of an RNAP obtained from the same genus as the heterologous 
gene. 

12. The isolated nucleic acid molecule of claim 11, wherein said second nucleic 
acid sequence is obtained from a prokaryotic cell selected from the group consisting of 
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Gram-negative bacteria, Gram-positive bacteria, acid-fast bacteria, mycoplasma, aerobic 
bacteria, anaerobic bacteria, and facultative bacteria. 

13 The isolated nucleic acid molecule of claim 12, wherein said second nucleic 
acid sequence is obtained from the genus Agrobacterium, Mycobacterium, Chlamydia, 
Pseudomonas or Streptomyces. 

14. The nucleic acid molecule of claim 13, wherein said second nucleic acid 
sequence is obtained from Agrobacterium. 

15. The nucleic acid molecule of claim 14, wherein said second nucleic acid _ 
sequence encodes the amino acid sequence as depicted in Figure 2A. 

16. The nucleic acid molecule of claim 14, wherein said second nucleic acid 
sequence encodes at least a portion of the amino acid sequence as depicted in Figure 2A. 

17. The nucleic acid molecule of claim 16, wherein said portion of the amino 
acid sequence as depicted in Figure 2A comprises at least amino acid residues 157 to 336. 

18. The nucleic acid molecule of claim 16, wherein said portion of the amino 
acid sequence as depicted in Figure 2A comprises less than amino acid residues 157 to 336. 

19. The nucleic acid molecule of claim 16, wherein said portion of the amino 
acid sequence as depicted in Figure 2 A comprises at least 8 consecutive amino acid residues 
in length. 

20. The nucleic acid molecule of claim 19, wherein said portion of the amino 
acid sequence as depicted in Figure 2 A comprises amino acid residues 329 to 336. 

21. The hybrid nucleic acid molecule of claim 11, wherein said first nucleic acid 
sequence is obtained from a prokaryotic cell selected from the group consisting of Gram- 
negative bacteria, Gram-positive bacteria, acid-fast bacteria, mycoplasma, aerobic bacteria, 
anaerobic bacteria, and facultative bacteria. 

22. The nucleic acid molecule of claim 21, wherein said first nucleic acid 
sequence is obtained from the genus Escherichia, Pseudomonas, Bacillus, Lactobacillus, 
Clostridium or Rhizobium. 

23. A nucleic acid construct comprising a nucleic acid molecule of any one of 
claims 1-22 operably linked to a promoter. 

24. A vector comprising the nucleic acid construct of claim 23. 
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25. A host cell comprising a nucleic acid construct of claim 23 or a vector of 
claim 24, wherein said promoter is operable in said host cell. 

26. A host cell of claim 25, further comprising at least one heterolgous gene 
operably linked to a promoter. 

27. A host cell of claim 26, wherein said heterolgous gene and promoter are 

contained in a vector. 

28. A host cell of any one of claims 25-27, wherein said host cell is a 
prokaryotic cell or a eukaryotice cell. 

29. A host cell of claim 28, wherein said host cell is a prokaryotic cell selected 
from the group consisting of Gram-negative bacteria, Gram-positive bacteria v acid-fast 
bacteria, mycoplasma, aerobic bacteria, anaerobic bacteria, and facultative bacteria. 

30. A host cell of claim 29, wherein said host cell is an Escherichia, 
Pseudomonas, Bacillus, Lactobacillus, Clostridium or Rhizobium. 

31 . A method of expressing at least one heterologous gene in a host cell 
comprising: 

transforming said host cell with a nucleic acid molecule of 23 and with at least one 
heterologous gene operably linked to a promoter; and 

culturing said transformed host cell under conditions wherein at least one of said 
heterologous genes is expressed in said host cell. 

32. The method of claim 31, further comprising transforming said host cell with 
at least one gene encoding a heterologous transcriptional regulator obtained from the same 
genus as the source of the heterologous gene. 

33. The method of claim 32, wherein said transcirptional regulator is a 
transcriptional activator that interacts with the a subunit of the RNAP to enhance the 
expression of said heterologous gene. 

34. The method of any one of claims 31 - 33, wherein said heterologous gene 
comprises multiple genes or operons in the same metabolic pathway. 

35. The method of claim 31, wherein said nucleic acid molecule and said at least 
one heterologous gene are contained in at least one vector. 

36. The method of claim 31, wherein said host cell is transformed with a vector 
comprising the nucleic acid molecule and with at least one vector comprising at least one 
heterologous gene. 
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37. The method of claim 31, wherein said host cell is a prokaryotic cell or a 
eukaryotic cell. 

38. The method of claim 37, wherein said host cell is a prokaryotic cell selected 
from the group consisting of Gram-negative bacteria, Gram-positive bacteria, acid-fact 
bacteria, mycoplasma, aerobic bacteria, anaerobic bacteria, and facultative bacteria. 

39. The method of claim 38, wherein said host cell is an Escherichia, 
Pseudomonas, Bacillus, Lactobacillus, Clostridium or Rhizobium. 

40. The method of claim 39, wherein said prokaryotic cell is an Escherichia. 

41 . The method of claim 3 1 , wherein said genus of the source of the heterolgous 
gene is a prokaryotic cell selected from the group consisting of Gram-negative bacteria, 
Gram-positive bacteria, acid-fast bacteria, mycoplasma, aerobic bacteria, anaerobic 
bacteria, and facultative bacteria. 

42. The method of claim 41, wherein said genus is Agrobacterium, 
Mycobacterium, Chlamydia, Pseudomonas or Streptomyces . 

43. The method of claim 42, wherein said genus is obtained from 
Agrobacterium. 

44. A method of expressing at least one heterologous gene in a host cell 
comprising: 

transforming said host cell with a nucleic acid molecule encoding a complete a 
subunit of an RNA polymerase obtained from the same genus as the heterologous gene or a 
portion of said a subunit of the RNA polymerase operably linked to a promoter, and with 
at least one heterologous gene operably linked to a promoter; and 

culturing said transformed host cell under conditions wherein at least one of said 
heterologous genes is expressed in said host cell. 

45. The method of claim 44, further comprising transforming said host cell with 
at least one gene encoding a heterologous transcriptional regulator obtained from the same 
genus as the source of the heterologous gene. 

46. The method of claim 45, wherein said transcirptional regulator is a 
transcriptional activator that interacts with said a subunit of the RNAP to enhance the 
expression of said heterologous gene. 

47. The method of any one of claims 44-46, wherein said heterologous gene 
comprises multiple genes or operons in the same metabolic pathway. 
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48. The method of claim 44, wherein said nucleic acid molecule and said at least 
one heterologous gene are contained in at least one vector. 

49. The method of claim 44, wherein said host cell is transformed with a vector 
comprising the nucleic acid molecule and with at least one vector comprising at least one 
heterologous gene. 

50. The method of claim 44, wherein said host cell is a prokaryotic cell or a 
eukaryotic cell. 

51. The method of claim 50, wherein said host cell is a prokaryotic cell selected 
from the group consisting of Gram-negative bacteria, Gram-positive bacteria, aicd-fast 
bacteria, mycoplasma, aerobic bacteria, anaerobic bacteria, and facultative bacteria. 

52. The method of claim 51 , wherein said host cell is from the genus 
Escherichia, Pseudomonas, Bacillus, Lactobacillus, Clostridium or Rhizobium. 

53. The method of claim 52, wherein said prokaryotic cell is an Escherichia. 

54. The method of claim 44, wherein said genus of the source of the heterolgous 
gene is a prokaryotic cell selected from the group consisting of Gram-negative bacteria, 
Gram-positive bacteria, acid-fast bacteria, mycoplasma, aerobic bacteria, anaerobic 
bacteria, and facultative bacteria. 

55. The method of claim 54, wherein said genus is Agrobacterium, 
Mycobacterium, Chlamydia, Pseudomonas or Streptomyces . 

56. A purified a subunit of an RNA polymerase of Agrobacterium or a portion 
thereof. 

57. A purified a subunit of an RNA polymerase of Agrobacterium of claim 55 or 
a portion thereof wherein the amino acid sequence of said a subunit is depicted in Figure 
2A or a portion thereof. 

58. A purified a subunit of an RNA polymerase of Agrobacterium of claim 55 or 
portion thereof, wherein the a subunit of RNA polymerase is encoded by the nucleic acid is 
depicted in Figure 2B or a portion thereof. 
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AMENDED CLAIMS 

[received by the International Bureau on 15 August 2000 (15.08.00); 
original claims 3-10, 15-20, 28, 34, 57 and 58 amended; 
remaining claims unchanged (6 pages)] 

1. An isolated nucleic acid molecule encoding at least a portion of the a subunit 
of an RNAP of Agrobacterium. 

2. An isolated nucleic acid molecule encoding the complete a subunit of an 
RNAP of Agrobacterium. 

3. An isolated nucleic acid molecule encoding the complete a subunit of an 
RNAP of Agrobacterium, wherein the amino acid sequence of said a subunit is depicted in 
Figure 2A (SEQ ID NO: 1). 

4. An isolated nucleic acid molecule encoding at least a portion of the a subunit 
of an RNAP of Agrobacterium, wherein the amino acid sequence of said a subunit is 
depicted in Figure 2A (SEQ ID NO:l). 

5. The nucleic acid molecule of claim 4, wherein said portion of the amino acid 
sequence as depicted in Figure 2 A (SEQ ID NO:l) comprises at least amino acid residues 
157 to 336. 

6. The nucleic acid molecule of claim 4, wherein said portion of the amino acid 
sequence as depicted in Figure 2 A (SEQ ID NO:l) comprises less than amino acid residues 
157 to 336. 

7. The nucleic acid molecule of claim 4, wherein said portion of the amino acid 
sequence as depicted in Figure 2 A (SEQ ID NO:l) comprises at least 8 consecutive amino 
acid residues in length. 

8. The nucleic acid molecule of claim 7, wherein said portion of the amino acid 
sequence as depicted in Figure 2 A (SEQ ID NO:l) comprises amino acid residues 329 to 
336. 

9. An isolated nucleic acid molecule comprising the nucleic acid sequence as 
depicted in Figure 2B (SEQ ID NO:2). 

10. An isolated nucleic acid molecule comprising at least a portion of the nucleic 
acid sequence as depicted in Figure 2B (SEQ ID NO: 2). 

11. A hybrid nucleic acid molecule for the expression of at least one 
heterologous gene in a host cell, wherein said hybrid nucleic acid molecule comprises a 
first nucleic acid sequence encoding at least a portion of the a subunit of an RNAP obtained 
from the same genus as the host cell and a second nucleic acid sequence encoding at least a 
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portion of the a subunit of an RNAP obtained from the same genus as the heterologous 
gene. 

12. The isolated nucleic acid molecule of claim 11, wherein said second nucleic 
acid sequence is obtained from a prokaryotic cell selected from the group consisting of 
Gram-negative bacteria, Gram-positive bacteria, acid-fast bacteria, mycoplasma, aerobic 
bacteria, anaerobic bacteria, and facultative bacteria. 

13 The isolated nucleic acid molecule of claim 12, wherein said second nucleic 
acid sequence is obtained from the genus Agrobacterium, Mycobacterium, Chlamydia, 
Pseudomonas or Streptomyces . 

14. The nucleic acid molecule of claim 13, wherein said second nucleic acid 
sequence is obtained from Agrobacterium. 

15. The nucleic acid molecule of claim 14, wherein said second nucleic acid 
sequence encodes the amino acid sequence as depicted in Figure 2 A (SEQ ID NO:l). 

16. The nucleic acid molecule of claim 14, wherein said second nucleic acid 
sequence encodes at least a portion of the amino acid sequence as depicted in Figure 2A 
(SEQ ID NO: 1). 

17. The nucleic acid molecule of claim 16, wherein said portion of the amino 
acid sequence as depicted in Figure 2A (SEQ ID NO: 1) comprises at least amino acid 
residues 157 to 336. 

18. The nucleic acid molecule of claim 16, wherein said portion of the amino 
acid sequence as depicted in Figure 2 A (SEQ ID NO:l) comprises less than amino acid 
residues 157 to 336. 

19. The nucleic acid molecule of claim 16, wherein said portion of the amino 
acid sequence as depicted in-Figure 2A (SEQ ID-NO:!) -comprises at- least -8 consecutive 
amino acid residues in length. 

20. The nucleic acid molecule of claim 19, wherein said portion of the amino 
acid sequence as depicted in Figure 2 A (SEQ ID NO:l) comprises amino acid residues 329 
to 336. . 

21 . The hybrid nucleic acid molecule of claim 1 1 , wherein said first nucleic acid 
sequence is obtained from a prokaryotic cell selected from the group consisting of Gram- 
negative bacteria, Gram-positive bacteria, acid-fast bacteria, mycoplasma, aerobic bacteria, 

anaerobic bacteria, and facultative bacteria. 
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22. The nucleic acid molecule of claim 21, wherein said first nucleic acid 
sequence is obtained from the genus Escherichia, Pseudomonas , Bacillus, Lactobacillus, 
Clostridium or Rhizobium. 

23 . A nucleic acid construct comprising a nucleic acid molecule of any one of 
claims 1-22 operably linked to a promoter. 

24. A vector comprising the nucleic acid construct of claim 23. 

25. A host cell comprising a nucleic acid construct of claim 23, wherein said 
promoter is operable in said host cell. 

26. A host cell of claim 25, further comprising at least one heterologous gene 
operably linked to a promoter. 

27. A host cell of claim 26, wherein said heterologous gene and promoter are 
contained in a vector. 

28. A host cell of claim 26, wherein said host cell is a prokaryotic cell or a 
eukaryotic cell. 

29. A host cell of claim 28, wherein said host cell is a prokaryotic cell selected 
from the group consisting of Gram-negative bacteria, Gram-positive bacteria, acid-fast 
bacteria, mycoplasma, aerobic bacteria, anaerobic bacteria, and facultative bacteria. 

30. A host cell of claim 29, wherein said host cell is an Escherichia, 
Pseudomonas, Bacillus, Lactobacillus, Clostridium or Rhizobium. 

31. A method of expressing at least one heterologous gene in a host cell 
comprising: 

transforming a host cell with a nucleic acid molecule of 23 and with at least one 
heterologous gene operably linked to a promoter; and 

culturing said transformed host cell under conditions wherein at least one of said 
heterologous genes is expressed in said host cell. 

32. The method of claim 31, further comprising transforming said host cell with 
at least one gene encoding a heterologous transcriptional regulator obtained from the same 
genus as the source of the heterologous gene. 

33. The method of claim 32, wherein said transcriptional regulator is a 
transcriptional activator that interacts with the a subunit of the RNAP to enhance the 
expression of said heterologous gene. 
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34. The method of claim 31, wherein said heterologous gene comprises multiple 
genes or operons in the same metabolic pathway. 

35. The method of claim 31 , wherein said nucleic acid molecule and at least one 
of said heterologous genes are contained in at least one vector. 

36. The method of claim 31 , wherein said host cell is transformed with a vector 
comprising the nucleic acid molecule and with at least one vector comprising at least one 
heterologous gene. 

37. The method of claim 31 , wherein said host cell is a prokaryotic cell or a 
eukaryotic cell. 

38. The method of claim 37, wherein said host cell is a prokaryotic cell selected 
from the group consisting of Gram-negative bacteria, Gram-positive bacteria, acid-fact 
bacteria, mycoplasma, aerobic bacteria, anaerobic bacteria, and facultative bacteria. 

39. The method of claim 38, wherein said host cell is an Escherichia, 
Pseudomonas, Bacillus, Lactobacillus, Clostridium or Rhizobium. 

40. The method of claim 39, wherein said prokaryotic cell is an Escherichia. 

41 . The method of claim 3 1 , wherein said genus of the source of the 
heterologous gene is a prokaryotic cell selected from the group consisting of Gram-negative 
bacteria, Gram-positive bacteria, acid-fast bacteria, mycoplasma, aerobic bacteria, 
anaerobic bacteria, and facultative bacteria. 

42. The method of claim 41, wherein said genus is Agrobacterium, 
Mycobacterium, Chlamydia, Pseudomonas or Streptomyces . 

43. The method of claim 42, wherein said genus is obtained from 
Agrobacterium. 

44. A method of expressing at least one heterologous gene in a host cell 
comprising: 

transforming said host cell with a nucleic acid molecule encoding a complete a 
subunit of an RNA polymerase obtained from the same genus as the heterologous gene or a 
portion of said a subunit of the RNA polymerase operably linked to a promoter, and with 
at least one heterologous gene operably linked to a promoter; and 

culturing said transformed host cell under conditions wherein at least one of said 
heterologous genes is expressed in said host cell. 
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45. The method of claim 44, further comprising transforming said host cell with 
at least one gene encoding a heterologous transcriptional regulator obtained from the same 
genus as the source of the heterologous gene. 

46. The method of claim 45, wherein said transcriptional regulator is a 
transcriptional activator that interacts with said a subunit of the RNAP to enhance the 
expression of said heterologous gene. 

47. The method of any one of claims 44-46, wherein said heterologous gene 
comprises multiple genes or operons in the same metabolic pathway. 

48. The method of claim 44, wherein said nucleic acid molecule and said at least 
one heterologous gene are contained in at least one vector. 

49. The method of claim 44, wherein said host cell is transformed with a vector 
comprising the nucleic acid molecule and with at least one vector comprising at least one 
heterologous gene. 

50. The method of claim 44, wherein said host cell is a prokaryotic cell or a 
eukaryotic cell. 

51 . The method of claim 50, wherein said host cell is a prokaryotic cell selected 
from the group consisting of Gram-negative bacteria, Gram-positive bacteria, acid-fast 
bacteria, mycoplasma, aerobic bacteria, anaerobic bacteria, and facultative bacteria. 

52. The method of claim 51, wherein said host cell is from the genus 
Escherichia , Pseudomonas, Bacillus, Lactobacillus , Clostridium or Rhizobium. 

53. The method of claim 52, wherein said prokaryotic cell is an Escherichia, 

54. The method of claim 44, wherein said genus of the source of the 
heterologous gene is a prokaryotic cell selected from the group consisting of Gram-negative 
bacteria. Gram-positive bacteria, acid-fast bacteria, mycoplasma, aerobic bacteria, 
anaerobic bacteria, and facultative bacteria. 

55. The method of claim 54, wherein said genus is Agrobacterium, 
Mycobacterium, Chlamydia, Pseudomonas or Streptomyces . 

56. A purified a subunit of an RNA polymerase of Agrobacterium or a portion 
thereof. 

57. A purified a subunit of an RNA polymerase of Agrobacterium of claim 55 or 
a portion thereof wherein the amino acid sequence of said a subunit is depicted in Figure 

2 A (SEQ ID NO:l) or a portion thereof. 
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58. A purified a subunit of an RNA polymerase of Agrobacterium of claim 55 or 
portion thereof, wherein the a subunit of RNA polymerase is encoded by the nucleic acid is 
depicted in Figure 2B (SEQ ID NO: 2) or a portion thereof. 



-42- 

AMENDED SHEET (ARTICLE 19) 



XID: <WO 006341 3A1 J.. > 



WO 00/63413 



PCT/USOO/10014 



Activation of 
virBp::lacZ 



Sa 


B 


St S 


E P 


X Sa 




1 


1 1 




1 1 


1 1 



PBK57.0 j- 



pBKE4.8 |- 



pBX4.1 |- 



PBP3.0 |- 



lacP 

.pPSUR — ► | 1 



lacP 

pPSl.3 | 1 



FIG. 1 

1 / 6 



^ID: <WO_ 006341 3A1J > 



WO 00/63413 



PCT7USOO/10014 



FIG. 2A 

MIQKNWQELIKPNKVEFTSSSRTKATLVAEPLERGFGLTLGNALRRVLLS 50 

SLRGAAVTAVQIDGVLHEFSSIPGVREDVTDIVLNIKEIAIKMDGDDSKR 100 

MWRKQGPGSVTAGDIQTVGDIEILNPDHVICTL.DEGAEIRMEFTVNNG 149 

KGYVPAE— RNRAEDAPIGLIPVDSLYSPVKKVSYKVENTREGQVLDYD 196 

K1JMTIETNGSVSGEDAVAFAARILQDQLGVFVNFDEPQKEAEEESVTEL 246 

AFWALLKKVDELELSVRSANCUO^DNIVYIGDLIQKTEAEMLRTPNFGR 296 

KSLNEIKEVLASMGLHLGMEVPAWPPENIEDLAKRYEDQY 336 
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FIG. 2B 

1 aacacggcat gaagtcgctt gaagtcgaag tttgcggtcc gggttccggt cgtgaatcgg 
61 cacttcgcgc tctgcaggct gccggtttca tgatcacttc cattcgcgac gccgatcccg 
121 cacaacggtt gccgtccgcg caagaagcgc cgcgtctgac gcgaccgtgg ttcggaaatt 
181 ccgcctttcc ttcggtctgg cggaattttc gtgtatctgg cgtgtgcgcg tcgatttcga 
241 tcgacggact tgcgctcaag aacccactga tgaaccactg aattaggttc ctctcggtgt 
301 tttcatgctc ggtccgtcac gattggatgg tggcggcgaa cggaaggttt aaagatgatt 
361 cagaagaact ggcaggaact tatcaagccg aacaaggtcg agttcacctc gtccagccgc 
421 accaaggcaa ctctggttgc cgagccgctg gagcgtggtt tcggtcttac cctcggcaac 
481 gcgctgcgcc gcgttctgtt gtcttctctg cgtggtgccg ctgtaacggc cgtgcagatc 
541 gacggtgtcc tgcacgaatt ctcctccatc cccggcgttc gggaagatgt gacggatatc 
601 gtgctcaaca tcaaggaaat cgccatcaag atggatggtg acgattccaa gcgcatggtc 
661 gtgcgcaagc agggtccggg ttcggtaacc gctggtgaca tccagacggt tggcgacatc 
721 gagatcctga accccgacca cgtgatctgc acgctcgatg aaggcgctga aatccgcatg 
781 gaattcaccg tcaacaacgg caagggttac gtaccggctg agcgcaaccg cgcggaagat 
841 gcccctatcg gcctcattcc ggtggacagc ctctattctc cggtcaagaa agtgtcctac 
901 aaggtggaaa acacccgtga aggtcaggtt ctcgactatg acaagctgat catgacgatc 
961 gagaccaacg gttcggtttc cggcgaagac gccgttgcct tcgccgctcg cattcttcag 
1021 gaccagctgg gcgtcttcgt caacttcgac gagccgcaga aggaagcaga agaagaatcg 
1081 gttactgaac tcgcgttcaa cccggcgctt ctcaagaagg tcgacgagct cgaactgtca 
1141 gttcgttcgg caaactgcct gaagaacgac aacatcgttt atatcggcga cctgatccag 
1201 aagaccgaag ccgaaatgct ccgcacgccg aactttggtc gcaagtcgct gaacgaaatc 
1261 aaggaagttc tcgcttccat gggtctgcac ctcggcatgg aagtgccggc atggccgcct 
1321 gagaacatcg aagatctcgc aaagcgttac gaagaccaat actaacaaac aagaaggcag 
1381 accttaaaga ctgcctttcc ccgtcaaaca gcagataagt catctgcatg tgccaggaaa 
1441 cggcaggcct taaagaaggc acctgcgtag aaggagaata gcaatgcgcc acggtaattc 
1501 aggccgcaag ctcaatagaa ccgccagcca ccgcaaggca atgtttgcca acatggctgc 
1561 ttcgctcatc acccatgagc agatcgtcac cacccttccg aaggcgaagg aaatccgtcc 
1621 gatcgtcgag cgtctcgtga cgctgggcaa gcgcggcgac ctgcacgctc gtcgtcaggc 
1681 gatctcgcag at 



0063413A1 I > 



3 / 6 



WO 00/63413 



PCT/US00/10014 



a turn. MI C KN WQ £ L I X PN KVE JT 3 S S RT KAT L V AEPLERGFGIiTIiGN2^I*RRVIil*S 50 
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AGROBACTERIUM TUMEFACIENS RPOA GENE 

This application is a continuation-in-part of provisional application Serial No. 
60/129,682 filed on April 16, 1999 and a continuation-in-part of provisional application 
Serial No. 60/134,206 filed on May 14, 1999, both of which are incorporated herein in 

10 their entirety by reference. 

This work was supported in part by NSF grant MCB-9722227. The U. S. 
Government may have certain rights to the present invention. 

The present invention relates to nucleic acid molecules and constructs containing 
these nucleic acid molecules and methods of using these constructs to express heterologous 

15 genes in hosts. In particular, the nucleic acid molecules of the present invention comprise a 
DNA sequence encoding at least a portion of the a subunit of the RNA polymerase 
(referred to herein as RNAP) obtained from the same genus source from which the 
heterologous genes were originally isolated. The nucleic acid constructs also can optionally 
comprise, if required for expression of the heterologous genes, at least one gene encoding a 

20 transcriptional regulator, also obtained from the same genus as the source of the 
heterologous genes. 

HArKOROTTND OF THE INVENTION 

Although many host cell systems are well characterized and utilized to express a 
25 number of heterologous genes, there are still many genes that cannot be expressed in such 
host cell systems, as for example, Escherichia coli (£. coli). It is believed that the lack of 
expression in these host cell systems is a result of the promoter sequences of the 
heterologous genes not being recognized by the host RNAP. The present invention has 
solved this problem by showing that this block in expression can be overcome by co- 
30 expression of at least a portion of the rpoA gene product, the gene that encodes the a 

subunit of the RNAP, obtained from the same genus as the source of the heterologous gene 
that is desired to be expressed in the new host cell. The co-expressed a subunit of the 
RNAP from the same genus as the source of the heterologous gene combines with the other 
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subunits of the RNAP, p, p', and a, to form a functional RNAP. Additionally, if 
transciptional regulators, such as transcriptional activators or transcriptional repressors, or 
additional other subunits of the RNAP, such as p, p' or a, are required to obtain expression 
of the heterologous gene, then these additional components also are obtained from the same 
5 genus as the source of the heterologous gene. Since two different rpoA genes are present in 
a single host cell, there will be various combinations of RNAPs present in the host cell (1) 
RNAP containing two a subunits of the same genus as the host (2) RNAP containing two a 
subunits from the same genus source as the heterologous gene, and (3) RNAP containing 
one a subunit from the same genus as the host and one a subunit from the same genus 

10 source as the heterologous gene. A heterologous gene is intended to mean a gene that is 
not from the same source as the host cell. 

A^robacierium tumefaciens is a Gram-negative soil bacterium which is the causative 
agent of Crown Gall disease, affecting primarily dicotyledonous plant species (reviewed in 
18, 62). The pathogen incites production of the characteristic tumor through the transfer of 

15 a piece of DNA (T-DNA) from the Ti (Tumor inducing) plasmid into susceptible plant 
cells, with subsequent integration into the host genome. The T-DNA contains genes that 
direct the biosynthesis of auxin and cytokinin in infected cells (1, 57), resulting in 
uncontrolled cell division leading to production of the characteristic tumor. The T-DNA 
also contains genes for the biosynthesis of unique compounds called opines which the 

20 bacterium can utilize as a carbon and nitrogen source (39). 

Successful transfer of the T-DNA is dependent on the coordinated expression of 
virulence (vir) genes located on the Ti plasmid but separate from the T-DNA. Expression 
of vir genes occurs in response to certain phenolic compounds released from wounded 
plants (54). This expression is augmented by monosaccharides (5, 52), and an acidic pH 

25 (38) which are characteristics of plant wound sites. Expression of vir genes requires virA 
and virG, which are members of the family of two component regulatory systems (60). 
VirA is an inner membrane associated histidine protein kinase which autophosphorylates in 
response to the environmental signals (19, 28). The phosphate moiety is subsequently 
transferred to the aspartate residue of VirG, which in turn activates transcription from 

30 promoters containing a specific 12 base pair sequence called the vir box, present in the 
promoters of all vir genes (29, 44). In addition to virA and virG, other chromosomally 
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encoded genes have been identified in A. tumefaciens that have been shown to modulate 
virulence gene expression either directly or indirectly (12, 15, 20, 61). 

The use of E. coil as a heterologous host in which to study the regulation of A. 
tumefaciens virulence genes and the mechanism of T-DNA transfer constitutes an ideal 

5 model system given the degree of characterization at both the biochemical and genetic level. 
However, all previous attempts to reconstitute vir gene expression in E. coil have not been 
successful. Possible explanations for the lack of vir gene expression include the presence of 
unidentified regulatory genes in A. tumefaciens required for vir induction, and/or that E. 
coli may contain specific repressor(s) of vir gene induction. 

10 A characteristic of vir gene promoters is the absence of a strong -35 sequence (10). 

Dnase I footprinting studies have shown that VirG protects a region extending into where 
the -35 consensus sequence should be (29, 44). It has been suggested that binding of VirG 
may functionally replace the -35 consensus sequence allowing transcription to occur. This 
situation is similar to Class II CAP-dependent promoters, in which the CAP binding site 

15 overlaps with the -35 sequence. Studies have demonstrated that transcription at Class II 
CAP-dependent promoters requires interaction between CAP and the a subunit (rpoA) of 
RNAP (42, 49, 64). Many transcriptional factors from E. coli are known to require 
interaction with the a subunit of RNAP , including FNR (59), GaiR (9), MarA (24), Mer R 
(7), MetR (23), OxyR (56), OmpR (21), Rob (26), SoxS (25), and TyrR (34). Analysis of 

20 the a subunit from E. coli indicates the presence of two independent domains, the N- 

terminal domain and the C-terminal domain (21, 27, 63). The N-terminal domain (NTD) is 
involved in the assembly of the core polymerase, while the C-terminal domain (CTD) is 
involved in interaction with certain transcriptional regulators (7, 9, 21, 23, 24, 25, 26, 32, 
34, 47, 56, 59). Recently, interaction between the NTD and CAP at Class II CAP- 

25 dependent promoters has been demonstrated (42,49). 

The present methods of expressing heterologous genes are particularly useful to 
express multiple genes or operons in a metabolic pathway by co-expressing the rpoA gene 
product obtained from the same natural hosts from which the genes or operons were 
originally isolated with the multiple genes. The present method provides an advantage over 

30 existing multiple gene expression systems by eliminating the need to separately isolate each 
gene in the metabolic pathyway and link each gene to a promoter that is functional in the 
host. Expression of heterologous genes for an entire metabolic pathway in hosts is 

3 
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particularly useful to produce particular metabolites, such as pharmaceuticals, food 
supplements, chemical products or fine chemical products. 

The present invention further provides nucleic acid molecules and methods whereby 
heterologous genes encoding metabolites in an entire metabolic pathway can be introduced 

5 into hosts so that these hosts possess specific catabolic and/or metabolic characteristics that 
allow the hosts to grow or express gene products under specific environmental conditions. 
These hosts are useful in bioremediation, bioassays and biocontrol studies where 
manipulation of natural or symbiotic bacterial flora is desired, such as in the control of 
parasites or insects where bacterial symbiants are involved. 

10 The present invention broadly relates to the expression of heterologous genes in a 

host using nucleic acid molecules comprising a gene encoding the complete a subunit of the 
RNAP obtained from the same genus as the source of the heterologous genes or at least a 
portion of the a subunit of the RNAP. The host and the heterologous genes to be expressed 
may be prokaryotic or eukaryotic in origin, however, prokaryotic hosts are preferred. The 

15 present invention also relates to a hybrid nucleic acid molecule encoding a hybrid a subunit 
of RNAP, where a first nucleic acid comprises at least a portion of the rpoA gene encoding 
the a subunit of an RNAP obtained from the same genus as the source of the host cell and a 
second nucleic acid comprises at least a portion of the a subunit of an RNAP obtained from 
the same genus as the heterologous gene. 

20 The present invention specifically discloses the rpoA gene from Agrobacterium 

encoding the a subunit of the RNAP and the corresponding amino acid sequence. Portions 
of both the rpoA gene and the a subunit of RNAP of Agrobacterium also are encompassed 
by the present invention, particularly the sequences disclosed in Figs. 2A and 2B. The 
rpoA gene is useful in methods of expressing Agrobacterium genes in hosts, particularly 

25 prokaryotic host cells. Prior to the present invention, it was not possible to express 

Agrobacterium genes, particularly Agrobacterium virulence genes, in E. coli. Therefore, 
the present invention provides a method for studying Agrobacterium genes and then- 
regulation. 

The invention also relates to genes encoding hybrid rpoA genes that comprise at 
30 least a portion of an rpoA gene from the same genus as a heterologous gene which is 
expressed, such as the Agrobacterium rpoA gene, with the remainder of the rpoA gene 
obtained from the same genus as the host cell in which the heterologous gene is expressed. 
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CTTMMAWY OF THT? INVENTION 

The present invention is directed to an isolated nucleic acid molecule encoding the 
complete a subunit of an RNAP of Agrobacterium or at least a portion thereof. 
Specifically, the present invention is directed to an isolated nucleic acid molecule encodmg 
the complete a subunit of an RNAP of Agrobacterium, in which the amino acid sequence of 
the a subunit is depicted in Figure 2A or at least a portion of this sequence. More 
specifically, the present invention is directed to an isolated nucleic acid molecule 
comprismg the complete nucleic acid sequence as depicted in Figure 2B or at least a pomon 
thereof These isolated nucleic acid molecules encoding at least a portion of the a subumt 
of an RNAP of Agrobacterium also are optionally linked to a promoter that is functional m 
the host in which these molecules are expressed. Further, the present invention is dtrected 
to vectors containing these nucleic acid sequences or molecules. The vectors are useful m 
transforming host cells for the expression of heterologous gene(s). The invention is also 
directed to host cells that are transformed with the nucleic acid molecules described herem. 
The nucleic acid molecules useful to transform the host cells are operably linked to a 
promoter and encode an a subunit of an RNAP or at least one heterologous protem. 

The present invention is also directed to a hybrid nucleic acid molecule for the 
expression of at least one heterologous gene in a host cell comprising a first nucleic acid 
sequence encoding at least a portion of the a subunit of an RNAP obtained from the same 
genus as the host cell and a second nucleic acid sequence encoding at least a portion of the 
a subumt of an RNAP obtained from the same genus as the source of the heterologous 
gene- and vectors containing the hybrid nucleic acid molecule. 

The present invention is further directed to a method of expressing at least one 
heterologous gene in a host cell comprising transforming a host cell with a vector 
comprising a nucleic acid molecule encoding a complete a subunit of RNAP from the same 
genus as the source of the heterologous gene or at least a portion thereof, and also 
transforming the host cell with a vector comprising at least one heterolgous gene. The 
nucleic acid molecule and the heterologous gene(s) also may be contained in one vector but 
preferably these sequences, are contained within two or more vectors. In a further 
30 embodiment, the present invention is also directed to a method of expressing at least one 
heterologous gene in a host cell comprising transforming a host cell with a vector 
comprising a hybrid nucleic acid molecule comprising a first nucleic acid sequence 



25 
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encoding at least a portion of the a subunit of an RNAP obtained from the same genus as 
the host cell and a second nucleic acid sequence encoding at least a portion of the a subunit 
of an RNAP obtained from the same genus as the source of the heterologous gene, and 
transforming the host cell with a vector comprising at least one heterologous gene; and 

5 culturing the transformed host cell under conditions where the heterologous gene is 
expressed in the host cell. The host cell also may be transformed with a single vector 
containing the nucleic acid molecule encoding the complete a subunit of the RNAP, a 
portion thereof or a hybrid molecule comprising a portion of the a subunit of the RNAP, 
and with the heterolgous gene(s) or a subunit of the RNAP peferably, two or more vectors 

10 may be used. The nucleic acid molecules encoding the a subunit and the heterologous 
gene(s) may be in separate vectors for transformation of the host cell. The host cell can 
optionally be transformed with at least one gene encoding a heterologous transcriptional 
regulator obtained from the same genus as the source of the heterologous gene. The 
heterologous gene may comprise multiple genes or operons in the same metabolic pathway. 

15 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 . Physical and genetic map of recombinant plasmids carrying the rpoA 
gene from A. tumefaciens is depicted. Abbreviations: B, BamHI; E. FcoRI; P, PstI; Sa, 
Sm3A; St, Sttd, X, Xhol Location and direction of rpoA and lac promoter for constructs 
20 pPS1.3R and pPS1.3 are indicated by arrows. 

Figure 2A. The amino acid sequence of the A. tumefaciens a subunit of the RNAP 

is depicted. 

Figure 2B. The A. tumefaciens rpoA gene encoding the A. tumefaciens a subunit 
of the RNAP is depicted. The actual coding sequence for the a subunit is located between 
25 nucleotide 355 (atg) and nucleotide 1356 (taa). 

Figure 3. Amino acid sequence alignment of the A. tumefaciens and E. coli rpoA 
gene products. Identical amino acid residues at a given position are marked by an asterisk, 
and conserved substitutions are marked by a dot. Gaps introduced to allow optimal 
alignment are designated by a dash. The bolded nucleotides show three highly conserved 
30 regions between the two amino acid sequences. 

Figure 4 A and B. Assembly of His-rpoA into RNAP, and in vitro transcription 
assays. Figure 4A - SDS-PAGE analysis of reconstituted RNAP containing E. coli p, £*, 

6 
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c » subuuirs and His-RpoA from E. coli or A. mmefaciens and sated with Coomassie 
briUiant blueB. Figure 4B - In vitro transcription from WrB and ga Un promoters m Ore 
absence and presence of VirG"". The figure shows rhe resnlr of a multiround ,ranscnpr,on 
region employing reconstituted RNAP containing His-rpoA from either E. col, or A. 
j .umefadens. Where indicamd. 2 pmolofVirG°» was added to the reaction. 

Figures Prorein-protein and protein-DNA interactions berween RpoA. VtrG and 
me virS promoter. The figures show eiectrophoretic mobility shift assays of [^-labeled 
*B promoter conuimng purified VirG™ and His-RpoA from E. coli in Figure 5A or A. 
.umefaciens in Figure 5B Lane 1: Promoter only. Lane 2: Promoter + VirG», Lane 3: 
,0 Promoter + 150 nM RpoA, Lane 4: Promoter + 300 nM RpoA, Lane 5: Promoter + 
600 nM RpoA, Lane 6: Promoter + VirG"" + 150 nM RpoA, Lane 7: Promoter + 
VirG" + 300 nM RpoA, Lane 8: Promoter + VirG"- + 600 nM RpoA. 

tvct a tt ED THRIFT™" OF THE PREFFFFFP EMBODIMENT 

Although many well characterized host cells, such as E. coli, provide advantages as 
an expression host cell, it has not been possible to express the genes of many bacteria in 
these host cells because the promoter sequences of the heterologous genes are not 
recognized by the host cell RNAP. The present invention provides a method of 
overcoming this block to expression by co-expressing the a subunit of the RNAP from the 
20 same genus as the source of the heterologous genes that are to be expressed in the host.and 
optionally co-expressing the transcriptional activator from the same genus as the source of 
the heterologous genes. The success of this method has been demonstrated by showing the 
expression of the Agrobacterium tumefaciens virulence (vir) gene operons in E. coli. In 
this specific example, expression of these genes was shown to require not only the vtr gene 
25 specific transcriptional regulator but also the rpoA gene from A. tumefaciens. The A. 
■ tumefaciens a subunit of the RNAP is able to assemble with the other subunits (p, P ' and o 
subunits) of the E. coli RNAP to form a functional polymerase which specifically 
recognizes the Agrobacterium genes in the presence of the Agrobacterium transcriptional 
regulator, specifically an activator. The results show that this interaction is genus specific, 
30 in that, the transcriptional activator from Agrobacterium can only efficiently interact with 
the rpoA gene product from Agrobacterium and not with that of E. coli. 
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Conventional understanding of gene expression in prokaryotes emphasizes the 
importance of the transcriptional regulator, either an activator or a repressor. A 
transcriptional activator is a protein that is needed to turn on the expression of target genes 
either directly or indirectly whereas repressors are proteins that inhibit the transcription of 

5 target genes either directly or indirectly. In the present invention, it is shown that the a 
subunit of the RNAP is the most critical factor in activating bacterial genes, especially 
those requiring transcriptional regulators. For example, to express heterologous genes or 
operons from organism A in organism B, at least a portion of the rpoA gene from organism 
A is also needed to be co-expressed in organism B. Likewise if transcriptional regulators 

10 are required for expression of heterologous genes, the genes encoding the transcriptional 
regulator(s) from organism A should also be used to transform the host because these 
regulators can most efficiently interact with RpoA from the same organism A but not as 
well or at all with that of organism B. 

The present method is particularly advantageous when it is used to express 

15 particular heterologous metabolic pathway genes in hosts for bioremediation/biocontrol 
purposes or production of particular metabolites. This approach becomes much more 
practical than fusing each gene or operon under the control of a known promoter that is 
functional in the host. 

Expression of genes for an entire metabolic pathway in heterologous hosts is useful 

20 to produce particular metabolites or to confer to the host specific catabolic or metabolic 
characteristics that now enables the host to grow or express gene products under specific 
environmental conditions. 

An example of the utility of the present method is shown by the production of a 
plant growth promoting molecule, indole-3-acetic acid, from Azospirillum brasilense, that 

25 requires at least 3 biosynthetic genes. A. brasilense also possesses multiple genes that 

encode products that fix nitrogen (65). The multiple genes responsible for encoding these 
products and the rpoA gene are introduced via one or more vehicles, such as plasmids or 
cosmids, into a host that is applied to a plant and that expresses the products of the multiple 
genes. The expressed plant growth promoting molecules and nitrogen fixing molecules 

30 affect the growth of the treated plant. 

Further Rhizobium, Brady rhizobium, Azorhiobium and Sinorhizobium are other 
examples of nitrogen-fixing bacteria, each involving at least two dozen genes for nodulation 
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(sy n,biosis) and multiple genes for nitrogen fixation. These bacreria. as we,l as otter 
IT^rta « produce useful produc* for plants, may also be the sonree of the heterologous 

geoes — - - - — • — genc produc,<s> ta a ^ 

pathway that degrades or brea.cs down toxic or environmentally ht*ardous products. For 
ample, a species strain ADP expresses at .east font genes iba. are tnvolve 

^degradation o, a berbicide, Atrazine (66). Tbese genes are tnrrodnced and expressed 
a bos. by appiying dte bos, to areas wbere ibis berbicide is present, renting in the 

degradation of the herbicide on site. 

Fnrtber, the present invention is useful in providing a biocontrol system that 
tacl „des a bos, that expresses heteroiogous gene products, that for examp.e, m disease 
producing organisms on a p.an, animai, human or a surface that wu, support growth , . dte 
Lt rL*~~* Cloroap^s strain 3W expresses at leas, three andfunga, metabobtes, 
p hen azine-l-curbo*yla.e, 2-hydroxyphenazine-l-carboxryUc acid and 2-hydroxyphenaz.ne 
L involves four biosyndtedc genes and 2 regmatory genes (67). These genes may be 
reduced into a pian. bacteria, such as « can be applied ,o a piant to contro, 

the growth of fungi on the plant. 

The present invention also is useful as an A. .un^aciens gene delivery system tn 
hetero,ogous hosts for po.en.ial gene delivery or gene therapy systems of A. 

20 genes to various hosts. 

The present invention also can be used to express heterologous genes in A. 
salens to aher its characteristics (e.g. bos, specificity) for a broader application of the 
A tumefaciens mediated gene delivery system. 

The present invention also is useful to prepare host cells containing an rpoA gene 
25 that is heterologous to the bos, cel., such as Escherichia. This host cell is useful for 
studying changes in gene expression patterns in this cell. 

The present invention can additionally be used to express particular sets of 
heterologous genes, including muhigene operons/stimu.ons or genes encoding multi-subnnt, 
protein complexes, such as proteosomes or gene libraries in hosts as targets for drug 

30 screening and/or diagnostics. 

The present invention discloses an isolated nucleic acid molecule encoding the 
complete a subunit of an RNAP of Agrobacterium or at least a portion thereof. More 



15 



CXSID' <WO 0063A13A1_IA> 



WQ00/634J3 PCT/USOO/10014 

specifically, an isolated nucleic acid molecule encoding the complete a subunit of an RNAP 
of Agrobacterium, where the amino acid sequence of the a subunit is depicted in Figure 2A 
is disclosed or at least a portion thereof. The portion can be a length of amino acid 
sequences that is sufficient to express any heterologous gene in the host cell into which it is 
5 introduced. The portion of the a subunit of the RNAP also should be of a sufficient length 
to provide at least one of the following functions: 1) bind to DNA in the promoter region 
2) interacts with the other subunits of RNAP; i.e., p, P' and a to form a functional RNAP, 
and 3) interacts with the transcriptional activator. More specifically, an isolated nucleic 
acid molecule comprising at least a portion of the nucleic acid sequence as depicted in 

10 Figure 2B and that encodes at least a portion of the amino acid sequence of Figure 2A is 
disclosed. The nucleic acid molecule (rpoA) encoding the Agrobacterium a subunit of the 
RNAP or a portion of the nucleic acid molecule is useful in the expression of 
Agrobacterium genes in any prokaryotic host cells. 

The present invention is also directed to a nucleic acid molecule encoding a hybrid 

15 a subunit of RNAP where the isolated nucleic acid molecule is inserted in a vector and is 
useful in expressing at least one heterologous gene in a host cell comprising a first nucleic 
acid sequence encoding at least a portion of the a subunit of an RNAP obtained from the 
same genus as the host cell and a second nucleic acid sequence encoding at least a portion 
of the a subunit of an RNAP obtained from the same genus as the source of the 

20 heterologous gene. This isolated nucleic acid molecule is operably linked to a promoter so 
that it is under the control of a promoter that is functional in the host cell. For example, 
the E. coli lac or trp promoters are useful to express the rpoA gene in an E. coli host cell. 
Promoters that function in numerous host cells are well known to persons skilled in the art 
and can be selected based upon the host cell selected. The second nucleic acid sequence is 

25 obtained from any eukaryotic or prokaryotic host cell, and more particularly from a 
prokaryotic cell, including Gram-negative bacteria, Gram-positive bacteria, acid-fast 
bacteria, mycoplasma, aerobic bacteria, anaerobic bacteria, and facultative bacteria. More 
specifically, the bacteria are selected from the genus Agrobacterium, Mycobacterium, 
Chlamydia, Pseudomonas or Streptomyces . More preferably, it is obtained from 

30 Agrobacterium, and most preferably the second nucleic acid sequence encodes the complete 
amino acid sequence as depicted in Figure 2A or a portion thereof, with the sufficient 
length of this portion discussed above. The portion of the of the rpoA gene can be from 
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any position in the gene; i.e., that is from the 5' or the 3' end of the gene, which encodes 
the N-terminus and C-terminus of the cx-subunit, respectively. Most preferably, the nucleic 
acid molecule portion encodes at least amino acid residues 157 to 336 as depicted in Figure 
2A. The nucleic acid molecule can encode as few as 8 consecutive amino acid residues of 
5 any portion of Figure 2A. Specifically, a portion comprising at least the amino acid 
residues 157 to 336, or of less than amino acid residues of 157 to 336, and more 
specifically amino acid residues 329 to 336 are useful in the present invention. The first 
nucleic acid sequence is obtained from any bacteria but more preferably, from the genus 
Escherichia, Pseudomonas, Bacillus, Lactobacillus, Clostridium or Rhizobium. 

10 The present invention is directed to a method of expressing at least one heterologous 

gene in a host cell comprising transforming a host cell with a nucleic acid construct 
comprising a first nucleic acid sequence encoding at least a portion of the a subunit of an 
RNAP obtained from the same genus as the source of the host cell, and a second nucleic 
acid sequence encoding at least a portion of the a subunit of an RNAP obtained from the 

15 same genus as the source of the heterologous gene; and with at least one heterologous gene; 
and culturing the transformed host cell under conditions wherein the heterologous gene is 
expressed in the host cell. The method further comprising transforming the host cell with 
at least one gene encoding a heterologous transcriptional regulator obtained from the same 
genus as the source of the heterologous gene. The heterologous gene may comprise 

20 multiple genes or operons in the same metabolic pathway that catalyze the production of a 
product. 

The host cells useful in the present method are prokaryotic cells or eukaryotic cells. 
Preferably, the host cell is selected from a prokaryotic cell selected from the group 
consisting of Gram-negative bacteria, Gram-positive bacteria, aerobic bacteria, acid-fast 

25 bacteria, mycoplasma, anaerobic bacteria, and facultative bacteria. Preferably, the host cell 
is obtained from the genus Escherichia, Pseudomonas, Bacillus, Lactobacillus, Clostridium 
or Rhizobium. Eukaryotic cells may be selected from known mammalian cells that can be 
grown in cell culture, such as CHO, BHK, COS. 

The present invention also is directed to a method of expressing at least one 

30 heterologous gene in a host cell comprising transforming the host cell with a vector 

comprising a nucleic acid molecule encoding a complete a subunit of an RNA polymerase 
obtained from the same genus as the heterologous gene or a portion of the a subunit of the 

11 
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RNA polymerase, and also transforming the host cell with at least one vector comprising at 
least one heterologous gene; and then culturing the transformed host cell under conditions 
where at least one of the heterologous genes is expressed in the host cell. The method 
further comprises transforming the host cell with at least one gene encoding a heterologous 

5 transcriptional regulator obtained from the same genus as the source of the heterologous 
gene. The transcirptional regulator is a transcriptional activator that interacts with the a 
subunit of the RNAP to enhance the expression of the heterologous gene. The heterologous 
gene may comprise multiple genes or operons in the same metabolic pathway. 

Preferably, the host cell is transformed with two vectors, one containing the rpoA 

10 gene or a portion thereof and one containing at least one heterologous gene. However, the 
host cell may be transformed with a single vector containing the rpoA and heterologous 
genes, but more than two vectors can be used to introduce the genes into the host cell 
depending on the number of genes and the size of the nucleic acid to be introduced. 

The host cell may be a prokaryotic cell or a eukaryotic cell. Preferably the host cell 

15 is a prokaryotic cell selected from the group consisting of Gram-negative bacteria, Gram- 
positive bacteria, acid-fast bacteria, mycoplasma, aerobic bacteria, anaerobic bacteria, and 
facultative bacteria. More preferably the host cell is from the genus Escherichia, 
Pseudomonas, Bacillus , Lactobacillus, Clostridium or Rhizobium. Most preferably the host 
cell is an Escherichia. 

20 The method is useful for the expression of at least one heterologous gene, where the 

genus of the source of the heterolgous gene is a prokaryotic cell selected from the group 
consisting of Gram-negative bacteria, Gram-positive bacteria, aerobic bacteria, anaerobic 
bacteria, and facultative bacteria; More preferably the genus of the source of the 
heterologous gene is Agrobacterium, Mycobacterium, Chlamydia, Pseudomonas or 

25 Streptomyces , and most preferably, the genus is obtained from Agrobacterium. 

The method utilizes a nucleic acid molecule that encodes the a subunit of the RNAP 
the amino acid sequence as depicted in Figure 2A or at least a portion of the amino acid 
sequence depicted in Figure 2A. The portion of the amino acid sequence useful in the 
present method may be from the N-terminus or C-terminus of the a subunit of the RNAP 

30 but preferably comprises at least amino acid residues 157 to 336 of the amino acid sequence 
depicted in Figure 2A or the portion of the amino acid sequence may comprise less than 
amino acid residues 157 to 336 of the amino acid sequence depicted in Figure 2A. The 
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portion of the amino acid sequence should comprises at least 8 consecutive amino acid 
residues of Figure 2A. More specifically, me portion of me amino acid sequence as 
depicted in Figure 2A comprises amino acid residues 329 to 336. 

The present invention also encompases a purified a subunit of an RNAP of 
5 Agrobacterium. More specifically, the purified a subunit of an RNAP of Agrobacterium of 
the present invention is the amino acid sequence depicted in Figure 2A. The invention is 
also intended to encompass portions of the purified a subunit of an RNAP disclosed in 
Figure 2A. More specifically, the purified a subunit of an RN A polymerase of 
Agrobacterium or a portion thereof is encoded by the nucleic acid is depicted in Figure 2B 

10 or a portion thereof 

A rpoA gene from the same genus as the source of a heterologous gene encoding the 
a subunit from this source is obtained by methods well known to persons skilled in the art. 
For example, this gene can be obtained: (1) by complementation of the a temperature 
sensitive rpoA mutant strain of a known organism, i.e. E.coli as discussed below in the 

15 preferred embodiment; (2) by Southern hybridization based on DNA sequence homology; 
(3) by Western blot based on cross reactivity of an antibody against a known RNAP a 
subunit; and (4) by PCR amplification based on conserved DNA sequences among known 
rpoA genes. For their expression, the rpoA genes are fused behind or operably linked to a 
promoter that is known to be operable in the host. The fusion is either a transcriptional 

20 fusion or translation^ fusion. Many standard methods are well known to persons skilled in 
the art and can be utilized to carryout these methods (See reference 68 which is herein 
incorporated in its entirety by reference.) 

The following specific examples set forth below serve to further illustrate the 
present invention in its preferred embodiments, and are not intended to limit the present 

25 invention to these examples. 

EXAMPLES 

Bacterial st rains, plasm as and media. 

All strains "and plasmids used or constructed are listed in Table 1. Bacterial strains 
30 were grown in either LB medium (40), Mannitol Glutamate Luria salts (MG/L) medium 
(58), or Induction medium containing 1 % glucose (61) at 28°C. Induction medium was 
used for attempts to reconstitute wild type vir gene induction, while MG/L and LB medium 

13 
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were utilized for strains containing virG*"" (i.e. pSY215 and pLG2). When appropriate, 
media was supplemented with ampicillin (100 ^g/ml), gentamycin (20 ug/ml), kanamycin 
(50 |ig/ml) and tetracycline (20 ug/ml) for E. coli and carbenicillin (100 ug/mi), 
gentamycin (100 ug/ml), kanamycin (100 ug/ml) and tetracycline (5 ug/ml) for A. 

5 tumefaciens. For determinations of p-galactosidase activity, 5-bromo-4-chloro-3-indoly-p- 
D-galactopyranoside (X-Gal) and isopropylthio-P-galactoside (IPIG) were included at a 
final concentration of 75 ug/ml and 1 mM, respectively. Acetosyringone (Sigma) and 
glucose was included, when necessary at 200 uM and 1 %, respectively. For induction 
assays, E. coli stain MC4100 containing constructs indicated in Table 1 were grown to 

10 stationary phase in 5 ml of appropriate medium, containing antibiotics as required. These 
overnight cultures were used to inoculate 125 ml flasks containing 30 ml of identical 
medium using 0.5 ml as an inoculum. The cultures were incubated at 28°C with shaking 
for 16 hours and assayed for p-galactosidase activity according to the method of Miller 
(40). For reconstitution of virulence gene expression in E. coil, plasmid constructs 

15 pSY204. pLG2, pGP159 and pSL107 were introduced into MC4100 by electroporation. 
Construct pH098 containing lac-driven A. tumefaciens rpoA was then introduced into these 
strains and initially screened on LB medium (pSY204 and pLG2) or induction medium 
(pGP159and pSL107) containing appropriate antibiotics, 1 mM IPTG, 75 mg/mi X-Gal and 
200 uM acetosyringone. Induction assays for these strains were carried out as described 

20 above. 

Isolation and subcloning of rvoA locus 

A cosmid clone library of A. tumefaciens strain A136, a derivative of strain C58 
lacking the Ti plasmid, in pVK102 was described previously (8). The cosmid clones were 
transformed into E. coli strain MC4100 containing plasmid pSY215. Transformants were 

25 initially screened for the development of a blue color on induction media containing X-Gal, 
indicating expression of the virBp::lacZ fusion. A cosmid clone, designated pBK2, was 
isolated by this procedure and was used for subsequent subcloning attempts, as outlined in 
Figure 1. 

DNA sequencing was performed by PCR mediated Taq DyeDeoxy Terminator 
30 Cycle sequence on an Applied Biosy stems model 377 DNA sequencer. The GCG sequence 
analysis software package (Genetics Computer Group, Madison, WI) and the BLAST 
software package (2) were used for all DNA and protein sequence analysis. The complete 
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nucleotide sequence of the 1.3 Kb Pstl-StuI DNA fragment of pPS1.3 was determined on 
both strands. The rpoA sequence has been deposited into GenBank (accession # AF1 1 1855) 
Overexpression and purification o f proteins 

For overproduction of the RpoA proteins, rpoA genes from£. coli and/4. 

5 tumefaciens were PCR amplified from the MC4100 chromosome and plasmid pBX4. 1 
(Figure 1), respectively, using primers designed to contain a BamHI restriction site. [E. 
colt 1)5' -CCA AAG AGA GGA TCC AAT GCA GGG-3', 2) 5'-CCT TAA CCT GGG 
ATC CGG TTA CTC G-3'; A. tumefaciens 1) 5 '-GGA AGG ATC CAA GAT GAT TCA 
GAA GA-3\ 2) 5' -CCT GGAA TCC TGC AGA TGA CTT ATC TG-3']. The PCR 

10 products were initially cloned into PCR2. 1TOPO (Invitrogen, Inc.), and then subcloned 
into pQE vectors (Qiagen) to generate pECH4 and pZL-2, containing N-terminal His- 
tagged fusions to RpoA of E. colt and A. tumefaciens, respectively. 

The His-tagged RpoA proteins of A. tumefaciens and E. coli were prepared from E. 
coli strain SG13009 (pREP4) containing pZL-2 or pECH4, respectively. Cells were grown 

15 at 37 °C in 500 ml of L-Broth containing selective antibiotics to an OD^ of 0.3, and then 
induced with 1 mM IPTG for 3 hours at 37'C. Cells were harvested, resuspended in 20 ml 
of binding buffer (20 mM Tris-HCl pH7.9, 500 mM NaCl, 5% Glycerol) and lysed by 
sonication. The lysates were cleared by low speed centrifugation and loaded on a Fast 
Flow Chelating Sepharose Column charged with Ni 2+ (Pharmacia, Inc.). The column was 

20 attached to a Waters 650 Advanced Protein Purification System, washed extensively with 
loading buffer containing 25 mM imidazole, and bound protein eluted with buffer 
containing 100 mM imidazole. Fractions containing greater than 90% RpoA were 
precipitated by addition of ammonium sulfate to 75% saturation. The resulting pellet was 
dissolved in a storage buffer (20 mM Tris pH7.9, 40 mM KC1, 10 mM MgCl 2 , 1 mM 

25 DTT, 50% Glycerol) and stored at 20°C. 

Recombinant p p' and a 70 subunits of E. coli RNAP were each purified from 
overexpression strains as described previously (16, 51). Induction was initiated with 1 mM 
IPTG at 37°C for 3 hours and the proteins present in inclusion bodies were then purified 
according to Tang (55). The constitutively active VirG protein (VirG con ) was purified from 

30 E. coli as described previously (30). 

RNAP holoenzymes were reconstituted from individually purified E. coli p P' and 
a 70 , and either E. coli or A. tumefaciens RpoA as described (3). The molar ration of a, p 
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and p" in the reconstitution reactions was 1:4:8. After reconstitution and thermoactivation 
in the presence of a 70 , RNAP preparations were further purified by gel exclusion 
chromatography on a Superose-6 column (Pharmacia) and ion exchange chromatography on 
a Resource Q column (Pharmacia). The purified RNAP holoenzymes were concentrated by 
filtration through a C-100 concentrator (Amicon, Inc.) to "1 mg/ml, and stored in 50% 
glycerol storage buffer at -20°C. 
Tn Vitro Transcription Assays 

For analysis of abortive initiation, 2 pmol of recombinant RNAP holoenzyme 
containing either E. coli or A. tumefaciens RpoA was incubated with 10 pmol of template 
DNA in 10 fx\ of transcription buffer (40 mM Tris-HCl pH7.9, 40 mM KC1, 10 mM 
MgCl,) for 10 minutes at 23 °C in the presence or absence Of the VirG" 3 " protein (2 pmol). 
As templates, a 130 bp EconRI-Hindin fragment of plasmid pAA121 containing the ga/Pl 
promoter (-63 to +44) or a 380 bp PCR product of plasmid pSM243cd (Stachel 1985) 
containing the virB promoter (-248 to +81) were used (4, 10). Abortive initiation 
reactions were initiated by addition of 0.5 mM initiating dinucleotide (CpA for ga/Pl 
promoter and ApU for virB promoter) and 0.5 pM [a- 33 p] UTP (ga/Pl) or [a- 33 p] ATP 
(virB) (3000Ci/mmol). After 25 incubation at 37°C, reactions were terminated by the 
addition of an equal volume of urea loading buffer. The reaction products were resolved on 
urea-PAGE (20% polyacrylamide, 19:1 acrylamide:bis) and visualized by autoradiography. 

Mobility shift assays 

Electrophoretic mobility shift DNA binding assays were carried out using a PCR 
amplified virB promoter labeled with (a- 32 P] dCTP. PCR reactions included 20 uCi (a- 
32 P]dCTP, plasmid pSM243cd as template, Primer 1:5'-TTC CAC GGT GAC GCA TCG 
AAT G-3\ and Primer 2:5'-CCC CGA TCT CTT AAA CAT ACC TTA TCT CC-3\ 
Unincorporated nucleotides were removed using the Wizard™ PCR Preps DNA 
Purification System kit from Promega. Mobility shift reaction mixtures contained 1600 
cpm 32 P-labeled virB promoter, 50 mM KC1, 20 mM Tris-HCl pH 7.0, 10 mM MgCl 2 , 1 
mM DTT, 10% Glycerol and 20 pg/ml Herring sperm DNA. Where indicated, 1.5 uM 
VirG was added, which was sufficient to saturate all VirG binding sites on the promoter, 
and His-RpoA was added at 150-500 nM final concentration. Reactions were incubated for 
30 minutes at 22°C, loaded onto 6% polyacrtlamide, 10% glycerol vertical slab gels in 
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0.5X TBE buffer (46) and electrophoresed at 20V/cm at 4°C for 2 hours. Following 
electrophoresis the gels were dried and autoradiographed overnight at room temperature. 
Identification of an A. tumefaciens gene that is required for vir gene expr ession in E. coli 
To reconstitute vir gene induction in E. coli, plasmid construct pGP 159 containing 
5 vir A and virG under the control of their native promoters and a lacZ reporter gene fused 
downstream of the virB promoter (virBp::lacZ fusion) was introduced into E. coli strain 
MC4100. The resulting strain failed to activate transcription of the fusion in response to 
acetosyringone (AS). The inability of pGP159 to activate transcription may have been due 
to a lack of expression of the yirA/virG genes, which are under the control of their native 

10 promoters. To address this possibility, the plasmid pSL107 was utilized, which contains 
vir A and virC under the control of the lac promoter and virBpdacZ. Introduction of 
pSL107 into MC4100 also failed to activate expression of the vir gene fusion in the 
presence of AS and IPTG. The observed lack of expression was not due to a lack at 
expression of virA and/or virG as both proteins were present at detectable levels by Western 

15 blot (data not shown). Since VirA and VirG were present, the VirA/VirG signal 

transduction mechanism may not be functional in E. coli. This possibility was addressed 
through the use of pSY215 which contains a constitutively active virG qon under the control 
of the lac promoter and vtrBp::lacZ. The virG" 0 " is able to activate expression of the 
virBp::lacZ fusion in A. tumefaciens strains independent of virA and acetosyringone (17, 

20 31, 45). When pSY215 was introduced into MC4100, once again no expression of the 

virBprAacZ fusion was observed. This suggested that additional genes from A. tumefaciens 
may be required or that E. coli may contain specific repressors of vir gene expression. 

To determine if additional genes from A. tumefaciens were required, a cosmid 
library constructed from chromosomal DN A of a Ti plasmidless strain A136, was 

25 introduced into MC4100(pSY215). Screening of the resulting transformants revealed the 
presence of a clone which produced a light blue color on colonies grown on induction 
media containing X-Gal and IPTG. This cosmid clone, designated pBK2, contains a 25 Kb 
DNA insert. In order to identify the gene residing in pBK2 required for expression of 
virBp::lacZ f overlapping subclones of pBK2 were generated which were introduced into 

30 MC4100(pSY215). Expression of the fusion was detected following introduction of 

pBX4.1, but not pBKS7.0 or pBKE4.8 (Fig. 1 & Table 2). The inability of pBKS7.0 and 
pBKE4.8 to activate expression of the fusion indicated that the 0.55 Kb region between 
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these two subclones is required for expression of the fusion. Given this information, a 1 .3 
Kb Stul-PstI DNA fragment from pBX4.1 was isolated and subcloned it into pTZ18R and 
P TZ19R yielding pPS1.3R and. PPSI.3, respectively. This resulted in two constructs in 
which the lac promoter on the vector drives transcription from either end of the fragment. 
5 Expression of the vir fusion was detected only with pPS1.3. indicating the absence of 
promoter elements in the fragment and the responsible gene is in the direction of PstI to 
Srul. Subclone pPS1.3 resulted in the highest level of expression of the fusion, almost 15 
fold higher than pBK2, and 40 fold higher than the vector control. This could be due to 
both increased copy number of the gene and increased gene expression by the strong lac 

10 promoter (Table 2). 

rhararterization of the identified gene 

The DNA sequence of the entire 1 .3 Kb Stul-PstI fragment was obtained from both 
strands. Analysis of the DNA sequence revealed an open reading frame of 1,008 base 
pairs, in the predicted transcriptional direction, sufficient to encode a polypeptide of 336 
15 amino acids as shown in Figure 2A. Nucleotide and protein searches of GenBlank and 
Swiss-Prot databases indicated a high degree of similarity to rpoA genes, encoding the a 
subunit of RNAP. Comparison with rpoA from E. coli indicated 62.15% sequence 
similarity and 51.4% sequence identity at the amino acid level as shown in Figure 3. Three 
highly conserved regions can be identified from the distribution of homologous amino acid 
20 residues between E. coli and A. tumefaciens rpoA homologues. One region extends from 
residue 30 to 51 near the N-terminus (20 of 22 identical), with the other two present in the 
C-terminal domain extending from residue 256 to 270 (13 of 15 identical) and from residue 
276 to 315 (30 of 42 identical). A notable difference is the presence of an additional eight 
residues at the C-terminus of the A. tumefaciens RpoA, compared to RpoA of E. coli. A 
25 potential Shine-Dalgamo sequence, GAAGGT, was found extending from -7 to -12 bp 

upstream of the proposed ATG initiation codon of rpoA. Analysis of partial DNA sequence 
obtained from pBKS7.0 and pBKE4.8 indicated the presence of regions upstream and 
downstream of rpoA with a high degree of sequence similarity to rpsK and rplQ, encoding 
Sll and L17 ribosomal proteins, respectively, possibly forming an operon structure similar 
30 to E. coli. 

Confirmation of the ORF was achieved by construction of a His-tagged rpoA fusion 
as described above, which yielded a polypeptide of the predicted size C 37 Kd) that is 
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slightly larger than E. coli RpoA (Fig. 4A). Furthermore, the DNA clone from A 
tumefaciens was able to complement a temperature sensitive rpoA mutant of E. coli, 
HN317, proving that it does encode a functional homologue of the RNAP a subunit. 
In vitro transcription of the vi rB promoter 
5 In vitro assays involving the rpoA subclones and a constitutive virG° on , in vitro 

transcription assays were performed. Purified individual components of E. coli RNAP, p 
P' and a 70 , were mixed with either His-RpoA of E.coli or A. tumefaciens and high 
molecular weight RNAP complexes were purified by sizing column (see Materials and 
Methods). As shown in Figure 4 A, both RpoA molecules were able to successfully 

10 assemble with the E. coli P P' and a 70 subunits into complete RNAP holoenzymes. When 
tested for in vitro transcription, both RNAP holoenzymes were equally efficient in initiating 
transcription from a a 70 dependent E. coli ga/Pl promoter (Fig. 4B), demonstrating that the 
hybrid RNAP containing RpoA (a-subunit of RNAP) of A. tumefaciens is a functional 
enzyme. Furthermore, no significant differences were detected in the amount of the 

15 transcript produced in the presence or absence ofvirG 0 ™. When the virB promoter was 
used as a template, the E. coli KNAP could activate low level transcription but no 
difference was evident with or without the VirG con protein. In contrast, the hybrid RNAP 
containing A. tumefaciens RpoA was able to activate transcription from the virB promoter 
at low levels, and addition of the VirG 00 " increased transcription by 4-5 fold as measured by 

20 quantification of the gel (Fig. 4B). These results confirm the in vivo assays demonstrating 
that only RNAP containing RpoA of A. tumefaiens is able to efficiently initiate transcription 
from the virB gene promoter in a VirG con dependent manner. The inability of E. coli 
RNAP to efficiently express the vir fusion even in the presence of the VirG con suggests that 
E. coli RpoA can not make the required contacts with the VirG"" 1 protein. 

25 Specific interaction between the VirG 00 " and RpoA of A. tumefaciens 

Since the C-terminal domain of the E. coli RpoA was known to interact with the 
A+T rich "UP element of certain promoters (Ross 1993), the E. coli and A. tumefaciens 
RpoA proteins were tested to determine if these proteins have different affinities for the 
virB promoter. As shown in Figure 5 (lanes 3, 4 and 5), A. tumefaciens RpoA was able to 

30 shift the mobility of the labeled virB promoter at the highest concentration used (600nM), 
whereas the same concentration of E. coli RpoA did not, suggesting that the RpoA of A. 
tumefaciens has a higher affinity for the virB promoter than RpoA of £. coli. Similar 
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mobility shift assays were carried out to determine if His-RpoA of E. coli and A 
tumefaciens interact differently with VirG"" 1 at the virB promoter. Increasing amounts of 
RpoA were used in. combination with a saturating quantity of VirG con for the virB promoter 
(Fig. 5, lane 2) The concentration of VirG con used was determined through separate 
5 mobility shift assays in which increasing amounts of VirG con resulted in two separate shifts 
in mobility, corresponding to binding of VirG con at the two vir boxes of the virB promoter 
(data not shown). At the three concentrations of E. coli RpoA used, there was no 
additional shift in the mobility of the promoter VirG con complex (Fig. 5A, lanes 6. 7 and 8). 
However, when RpoA from A. tumefaciens was used, two separate shifts were observed 

10 (Fig 5B, lanes 6, 7 and 8) which suggests specific interactions between VirG con and RpoA 
from A. tumefaciens. These results provide further evidence suggesting that RpoA from A. 
tumefaciens may possess a higher affinity for VirG 00 " compared to RpoA from E. coli. 
Combination of virA/G and rpoA of A. tumefaciens is insufficient to reconstitute 
acetosvringone mediated vir gene induction in E. coli 

15 To determine if the signaling mechanism, resulting in vir gene activation, can be 

reconstituted in E. coli, a wild type vir A and virG was used in combination with A. 
tumefaciens rpoA. M04100 harboring two plasmid constructs, one containing a 
virBp::lacZ fusion as well as wild type virA/virG under their native promoters (pGP159), 
and. the other containing a lac-driven A. tumefaciens rpoA gene (pH098), did not show any 

20 significant increase in p-galactosidase activity in the presence of acetosyringone (data not 
shown). The possibility that lack of expression of vir A and/or virG may account for this 
result was again addressed through the use of pSL107 which contains lac-driven virA/virG 
and virBp::lacZ. However, it was not possible to obtain significant expression of the fusion 
in MC4100 harboring pSL107 and pH098. Introduction of pH098 into MC4100(pLG2), 

25 containing lac-driven VirG 10 " and virBp::lacZ, however, resulted in a significant increase in 
P-galactosidase activity, demonstrating that pH098 is able to produce a functional RpoA 
protein. These results suggest that the signal transduction mechanism of VirA/VirG may 
not be functional in E. coli. 

The present invention discloses that it is possible to obtain the expression of 

30 heterologous genes that previously could not have been expressed in a host, such as E. coli, 
by including a rpoA gene or a portion thereof from the same host genus as the expressed 
gene is obtained The ability to use E. coli as a heterologous system provides investigators 
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with a valuable tool for studying these various processes. The identification of a 
chromosomally encoded A. tumefaciens rpoA gene and the demonstration that it constitutes 
one of the components required for expression of a virBp::lacZ gene fusion in a 
heterologous E. coli background supports the present invention. The rpoA gene of E. coli 
5 has been extensively studied, particularly with regard to interactions with transcriptional 
regulators, and suggests that interaction between VirG and RpoA may be required for 
efficient transcription of virulence genes. 

The inability of pGP159 or pSL107 to activate transcription of the virBp::lacZ 
fusion suggested that either signal transduction between VirA and VirG was not functional, 

10 or that additional gene(s) were required from A. tumefaciens for activation. The use of 
pSY215 containing a virG con mutant allowed us to evaluate vir gene expression in a vir A- 
independent manner, eliminating the need for signal transduction. The lack of expression 
obtained with pSY215 in E. coli, combined with its ability to function in a Ti plasmidless 
A. tumefaciens strain suggested that additional/!, tumefaciens gene(s) were required for 

15 expression. The introduction of pPS1.3 containing lac driven rpoA into MC4100(pST215) 
resulted in a significant increase (40 fold) in transcription of the virBp::lacZ fusion 
compared to the control vector. Verification that rpoA is required was obtained through the 
use of subclone pPS1.3R, which did not activate expression of the fusion. This construct is 
identical to pPSL3, but the direction of transcription of rpoA is opposite to that of the lac 

20 promoter. The observation that A. tumefaciens rpoA was able to complement a temperature 
sensitive rpoA mutant in E. coli demonstrates an ability to function at essential E. coli 
promoters. This is evident from the in vitro transcription assay where the hybrid NAP was 
equally effective as E. coli RNAP in transcribing a a 70 -dependent ga/Pl promoter (Fig. 
4B). While the expression of the virBp::lacZ fusion (Table 1) was significantly increased, 

25 the level of expression was relatively low in comparison to expression in A. tumefaciens 

(31). The relatively low expression of the virBprAacZ fusion may have been a consequence 
of the presence of RNAP containing RpoA of E. coli. In order to remove possible 
interference from E. coli RpoA, in vitro transcription assays using reconstituted RNAP 
holoenzymes containing His-RpoA from either E. coli or A. tumefaciens were carried out. 

30 Using purified E, coli p, p' and a 70 subunits, it was demonstrated that both of the His- 

RpoA were able to assemble into multi-subunit RNAP holoenzymes (Fig. 4A). The results 
of the in vitro transcription assays demonstrated that VirG con -dependent transcription of the 
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virBp::lacZ fusion requires RNAP containing A. tumefaciens RpoA, although the two 
reconstituted holoenzymes exhibited essentially identical activity in transcription from the 
ga/Pl promoter, with no significant difference in the presence or absence of VirG con (Fig 
4B). Another possible explanation for the relatively low induction in E. coli may due to the 
5 presence of £. coli sigma factors in the RNAP holoenzymes. It is conceivable that E. coli 
sigma factors have a lower affinity for the virB promoter than sigma factors from A. 
tumefaciens. Although the vegetative sigma factor from A. tumefaciens has been identified 
(50), it is unclear whether this or an alternative sigma factor is involved in vir gene 
transcription. 

10 Previous reports have identified the presence of an "UP element " in certain E. coli 

promoters which is required for optimal transcription (14, 41, 47). This element extends 
from -40 to -60 bp upstream of the transcription start site and is highly A+T rich. 
Interestingly, the virB promoter contains an A+T rich sequence from -40 to -60 that 
overlaps wit the VirG binding sites. Whether this region of the promoter constitutes a true 

15 "UP element" is unknown. From the gel shift assays, the A. tumefaciens RpoA appears to 
have a higher affinity for the virB promoter than E. coli RpoA (Fig 5 A and B), although the 
importance of this observation is unclear at this time. The results of the mobility shift 
assay suggest that E. coil RpoA is unable to bind to VirG*" 1 at the virB promoter. In 
contrasts, A. tumefaciens RpoA appears to exhibit cooperative binding with two distinct 

20 shifts in the mobility of the VirG con -virB promoter complex. Taken together, these results 
indicate that RNAP containing E. coli RpoA may be unable to interact effectively with 
VirG 00 ", and therefore can not activate transcription from the virB promoter. Since the virB 
promoter contains two binding sites for VirG, the presence of two shifts obtained with 
increasing amounts of A. tumefaciens RpoA may be a consequence of RpoA interacting 

25 with VirG at each vir box. 

The two component regulatory system, composed of vzrA and vzrG, is indispensible 
for transcription of virulence genes within A. tumefaciens. However, vzrA and virG are 
insufficient to activate transcrition from virulence gene promoters within E. coli cells, 
indicating a requirement for additional A. tumefaciens gene(s). 

30 In examining vir gene expression in E. coli, attempts to reconstitute wild type 

virulence gene expression in E. coli were not successful. The use of vir A and virG under 
the control of the lac promoter means that sufficient levels of vir A and vzrG should be 
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present for signal transduction to take place. One possible explanation may be that E. coli 
is unable to correctly insert virA into the inner membrane. Alternatively, even, though 
virA may be inserted into the inner membrane correctly, dimerization of virA which is 
required for activity in. A. turnefuciens (43) may be defective. A more likely explanation 
5 may be that additional gene(s) from A. tumefaciens are required for efficient signal 
transduction. An unresolved question is the exact mechanism of sensing of phenolic 
inducers by the virAlvirG system. The two possible mechanisms involve direct binding of 
the inducer by virA, or binding by a second receptor which then interacts with virA. 
Although genetic evidence supporting direct binding of inducers by virA has been reported 

10 (36, 37), all attempts to demonstrate direct binding by virA have been unsuccessful. 

Conversely, there have been reports in which binding of phenolic compounds by proteins 
other than virA have been detected (13, 35), although there is no evidence to link these 
proteins with vir gene induction. The search for additional A. tumefaciens gene(s) involved 
in the signal transduction should be simplified by determining that virG"" 1 mediated 

15 expression of virulence genes requires RpoA from A. tumefaciens. The present invention 
provides the basis to examine vir gene expression as well as the T-DNA transfer process in 
E. coli. 

This specific example shows that for the expression of vir genes in E. coli, both 
rpoA and virG from A. tumefaciens are required for transcriptional activation of a vir 

20 promoter in E. coli. It has been determined that the rpoA gene, encoding the a subunit of 
RNAP, confers significant expression of a virBp;;lacZ fusion in E.coli in the presence of an 
active transcriptional regulator virG gene. In vitro transcription assays were conducted 
using either reconstituted E. coli RNAP or hybrid RNAP in which the a subunit was 
derived from A. tumefaciens. Both RNAPs were equally efficient in transcription from a 

25 a 70 - dependent E. coli galPl promoter, however, only the hybrid RNAP was able to 
transcribe the virB promoter in a virG-dependent manner. 

Evidence is also presented which indicates that virG interacts with rpoA from A. 
tumefaciens but not with rpoA from E. coli. This observation suggests that in order for 
successful transcription of vir genes to occur, specific interaction(s) between the A. 

30 tumefaciens a subunit of RNAP and virG is required. 

As shown in Figure 3, one of the major difference between the RpoA sequences of 
A. tumefaciens and E.coli is the presence of an extra 8 consecutive amino acids at the C- 
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terminus of A. tumefaciens RpoA (amino acids 329 to 336). To test if these C-terminal 8 
consecutive amino acids are important in mediating vir gene activation in E.coli, a plasmid 
construct was generated from pPS1.3 by deleting the DNA coding for the last 8 amino acids 
using site directed mutagenesis. The resulting plasmid pAD8 encodes 328 amino acid long 
5 A. tumefaciens RpoA, lacking the original amino acids from 329 to 336. 

The vir gene activating ability of this construct was tested by introducing pAD8 into 
E.coli strain MC4100 harboring pSY215 which contains vir A, virG and virB::lacZ fusion. 
As negative and positive controls, vector plasmid pQE31 or pPS1.3 which encodes intact 
RpoA of A. tumefaciens was introduced into MC4100(pSY215), respectively. The v/rgene 

10 activation was monitored by measuring the p-galactosidase activities. As shown in Table 3, 
pAD8 failed to activate p-galactosidase activity whereas the pPS1.3 induced a high level p- 
galactosidase activity. These results indicate that the C-terminal 8 amino acids of A. 
tumefaciens RpoA are essential for the vir gene activation in E.coli. 

These experiments also demonstrate that incorporation of a rpoA gene of a 

15 prokaryote, such as A. tumefaciens, into an expression vector will promote the expression of 
genes from the prokaryotic cells in other prokaryotic cells, such as E. coli. Expression of 
this expression vector comprising at least a portion of an A. tumefaciens gene is a useful 
gene delivery system in heterologous hosts for gene delivery and gene therapy. Further, 
incorporation of at least a portion of an rpoA gene derived from the same cells as one or 

20 more heterologous genes, into a vector containing the heterologous genes, provide 

improved systems for expressing the heterologous genes. These expression vectors which 
comprise one or more heterologous genes and at least a portion of an rpoA gene derived 
from the same cells as the heterologous genes, are useful in the production of selected 
metabolites, by expression genes for an entire metabolic pathway in a heterologous host. 

25 The following list of publications referred to in the specification are herein 

incorporated in their entirety by reference. 
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Table 1. Bacterial Strains and plasmids used in this study 



Strain or plasmid 


Genotype or phenotype 


Source 


E, coli 








DH5a 


recA endAI hsdRH supE4 gyrA96 reLAI (lacZYA- 


(56) 






argF)UJ69 (80dlacZ Ml 5) 






MC4100 


F araDl39 (argF-kac)Ul69 rpsLl50 relA fib530l 


(57) 






ptsF25deoCl 






HN317 


rpoAin^St* 


(58) 




Plasmids 








pGP159 


virA. virG& virBp: :lacZ; IncP, Ap'.TC' 


(59) 




pSM234acd 


virA & virBp: :lacZ: IncP. Km' 


(45) 




pLG2 


lacp::virG(N54D) & virBp. AacZ; IncP, Ap r Tc' 


This 


study 


pSL107 


lacp: MrG & v/>Bp: :lacZ: IncW.APTc' 


This 


study 


pPC401(N54D) 


lacp::virG(N54D) in pTZ18R.Ap' 


This 


study 


pSY215 


vz'rBp: :lacZ & lacp: :virG(N54D); IncW, Gm r 


This 


study 


pVK102 


Cosmid cloning vector; IncP, Km',Tc r 


(60) 




P TZ18R/pTZ19R 


Cloning vector; ColEl.Ap r 


USB 




pBK2 


pVKl02 with 25 Kb chromosomal insert from 


This 


Study 




A. tumefaciens strain A136, Tc 1 






pBKS2-2 


pTZ18R with 7. 1 Kb Sauhh insert from pBK2 


This 


study 


pBX4.1 


pTZ18R with 4.3 Kb BamUl-Xbal insert from pBKS2-2 


This 


study 


pBKS7.0 


pTZ18R with 4.2 Kb 5aw3A-Sa7I insert from pBKS2-2 


This 


study 


pBKE4.8 


pTZ18R with 1.7 Kb Ec6Rl-Sau2>& insert from pBKS2-2 


rp t - 

This 


study 


prol.3K 


nT7lRR with 1 ^ Vh Pvtl-^tiA insert from nBKS2-2 

Dl /. 1 nl\ W1LI1 I. J IV U I oil ulUl lllawl t 11 will pijivu^ J- 


This 


study 


pPS1.3 


pTZ19R with 1.3 Kb Pstl-Stul insert from pBKS2-2 


This 


study 


pZL-2 


Overproducer of A. tumefaciens His-RpoA: Ap l 


This 


study 


pECH4 


Overproducer of E. coli His-RpoA.Ap' 


This 


study 


pH098 


lacpy.rpoA of A. tumefaciens frompPS1.3: IncW. Km' 


This 


study 
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Table 2. RpoA mediated expression of vz>BP::IacZ fusion in £. coli MC4100 
containing virG wsl 

Constructs ~" LB medium MG/L medium 

p-Galactosidase b Fold Activation' (3-Galactosidase b Fold Activation 



pBK102 


2.46 


NA 


2.35 


NA 


pBK2 


5.2 


2.11 X 


6.56 


2.79 X 


pTZ18R 


2.24 


NA 


2.51 


NA 


pBKS2-2 


13.67 


6.1 X 


7.25 


2.89 X 


pBX4.1 


26.42 


11. 8X 


8.11 


3.23 X 


pBKS7.0 


1.68 


0.75 X 


2.46 


0.98 X 


pBKE4.8 


2.36 


1.05 X 


1.63 


0.65 X 


pBP3.0 


5.45 


2.43 X 


2.83 


1.13 X 


pPS1.3R 


2.13 


0.95 X 


1.71 


0.68 X 


pPS1.3 


76.14 


40.0 X 


37.69 


15.0 X 



The virBpwlacZ expression assays were carried out in LB and MG/L medium for 16 
hours as described in the text, IPTG was added at a final concentration of 1 mM. E. coli 
MC4100 containing pSY215 (virGT 11 + vz>5p::lacZ). b Values are the average of three 
replicates. 

c Values indicate an increase or decrease in activity compared to the appropriate 
control vector. N A, Not Applicable. 



Table 3. (3-galactosidase activities 



E.coli strain MC4100(pSY215) 
plus plasmid shown in right 


pPS1.3 


pAD8 


pQE31 


(3-galactosidase in 
Miller Units* 


140 (+/-25) 


4( + /-l) 


3 { + 1-2) 



* Miller Units = Defined in Ref. 68. 
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We claim : 

1 . An isolated nucleic acid molecule encoding at least a portion of the a subunit 
of an RNAP of Agrobacterium. 

2. An isolated nucleic acid molecule encoding the complete a subunit of an 
RNAP of Agrobacterium. 

3. An isolated nucleic acid molecule encoding the complete a subunit of an 
RNAP of Agrobacterium, wherein the amino acid sequence of said a subunit is depicted in 
Figure 2A. 

4. An isolated nucleic acid molecule encoding at least a portion of the a subunit 
of an RNAP of Agrobacterium, wherein the amino acid sequence of said a subunit is 
depicted in Figure 2A. 

5. The nucleic acid molecule of claim 4, wherein said portion of the amino acid 
sequence as depicted in Figure 2A comprises at least amino acid residues 157 to 336. 

6. The nucleic acid molecule of claim 4, wherein said portion of the amino acid 
sequence as depicted in Figure 2A comprises less than amino acid residues 157 to 336. 

7. The nucleic acid molecule of claim 4, wherein said portion of the amino acid 
sequence as depicted in Figure 2A comprises at least 8 consecutive amino acid residues in 
length. 

8. The nucleic acid molecule of claim 7, wherein said portion of the amino acid 
sequence as depicted in Figure 2A comprises amino acid residues 329 to 336. 

9. . An isolated nucleic acid molecule comprising the nucleic acid sequence as 
depicted in Figure 2B. 

10. An isolated nucleic acid molecule comprising at least a portion of the nucleic 
acid sequence as depicted in Figure 2B. 

11. A hybrid nucleic acid molecule for the expression of at least one 
heterologous gene in a host cell, wherein said hybrid nucleic acid molecule comprises a 
first nucleic acid sequence encoding at least a portion of the a subunit of an RNAP obtained 
from the same genus as the host cell and a second nucleic acid sequence encoding at least a 
portion of the a subunit of an RNAP obtained from the same genus as the heterologous 
gene. 

12. The isolated nucleic acid molecule of claim 11, wherein said second nucleic 
acid sequence is obtained from a prokaryotic cell selected from the group consisting of 
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Gram-negative bacteria, Gram-positive bacteria, acid-fast bacteria, mycoplasma, aerobic 
bacteria, anaerobic bacteria, and facultative bacteria. 

13 The isolated nucleic acid molecule of claim 12, wherein said second nucleic 
acid sequence is obtained from the genus Agrobacterium, Mycobacterium, Chlamydia, 

Pseudomonas or Streptomyces . 

14. The nucleic acid molecule of claim 13, wherein said second nucleic acid 

sequence is obtained from Agrobacterium. 

15. The nucleic acid molecule of claim 14, wherein said second nucleic acid 
sequence encodes the amino acid sequence as depicted in Figure 2A. 

16. The nucleic acid molecule of claim 14, wherein said second nucleic acid 
sequence encodes at least a portion of the amino acid sequence as depicted in Figure 2A. 

17. The nucleic acid molecule of claim 16, wherein said portion of the amino 
acid sequence as depicted in Figure 2A comprises at least amino acid residues 157 to 336. 

18. The nucleic acid molecule of claim 16, wherein said portion of the amino 
acid sequence as depicted in Figure 2A comprises less than amino acid residues 157 to 336. 

19 . The nucleic acid molecule of claim 16, wherein said portion of the amino 
acid sequence as depicted in Figure 2A comprises at least 8 consecutive amino acid residues 
in length. 

20. The nucleic acid molecule of claim 19, wherein said portion of the amino 
acid sequence as depicted in Figure 2A comprises amino acid residues 329 to 336. 

21. The hybrid nucleic acid molecule of claim 1 1 , wherein said first nucleic acid 
sequence is obtained from a prokaryotic cell selected from the group consisting of Gram- 
negative bacteria, Gram-positive bacteria, acid-fast bacteria, mycoplasma, aerobic bacteria, 
anaerobic bacteria, and facultative bacteria. 

22. The nucleic acid molecule of claim 21 , wherein said first nucleic acid 
sequence is obtained from the genus Escherichia, Pseudomonas, Bacillus, Lactobacillus, 
Clostridium or Rhizobium. 

23 . A nucleic acid construct comprising a nucleic acid molecule of any one of 
claims 1-22 operably linked to a promoter. 

24. A vector comprising the nucleic acid construct of claim 23. 
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25 . A host cell comprising a nucleic acid construct of claim 23 or a vector of 
claim 24, wherein said promoter is operable in said host cell. 

26. A host cell of claim 25, further comprising at least one heterolgous gene 

operably linked to a promoter. 

27. A host cell of claim 26, wherein said heterolgous gene and promoter are 

contained in a vector. 

28. A host cell of any one of claims 25-27, wherein said host cell is a 

prokaryotic cell or a eukaryotice cell. 

29. A host cell of claim 28, wherein said host cell is a prokaryotic cell selected 
from the group consisting of Gram-negative bacteria, Gram-positive bacteria, acid-fast 
bacteria, mycoplasma, aerobic bacteria, anaerobic bacteria, and facultative bacteria. 

30. A host cell of claim 29, wherein said host cell is an Escherichia, 
Pseudomonas, Bacillus, Lactobacillus, Clostridium or Rhizobium. 

31. A method of expressing at least one heterologous gene in a host cell 

comprising: 

transforming said host cell with a nucleic acid molecule of 23 and with at least one 
heterologous gene operably linked to a promoter; and 

culturing said transformed host cell under conditions wherein at least one of said 
heterologous genes is expressed in said host cell. 

32. The method of claim 3 1 , further comprising transforming said host cell with 
at least one gene encoding a heterologous transcriptional regulator obtained from the same 
genus as the source of the heterologous gene. 

33 . The method of claim 32, wherein said transcirptional regulator is a 
transcriptional activator that interacts with the a subunit of the RNAP to enhance the 

expression of said heterologous gene. 

34. The method of any one of claims 31-33, wherein said heterologous gene 
comprises multiple genes or operons in the same metabolic pathway. 

3-5. The method of claim 31, wherein said nucleic acid molecule and said at least 
one heterologous gene are contained in at least one vector. 

36. The method of claim 31, wherein said host cell is transformed with a vector 
comprising the nucleic acid molecule and with at least one vector comprising at least one 
heterologous gene. 
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37. The method of claim 31, wherein said host cell is a prokaryotic cell or a 
eukaryotic cell. 

38. The method of claim 37, wherein said host cell is a prokaryotic cell selected 
from the group consisting of Gram-negative bacteria, Gram-positive bacteria, acid-fact 
bacteria, mycoplasma, aerobic bacteria, anaerobic bacteria, and facultative bacteria. 

39. The method of claim 38, wherein said host cell is an Escherichia, 
Pseudomonas, Bacillus, Lactobacillus, Clostridium or Rhizobium. 

40. The method of claim 39, wherein said prokaryotic cell is an Escherichia. 

41 . The method of claim 3 1 , wherein said genus of the source of the heterolgous 
gene is a prokaryotic cell selected from the group consisting of Gram-negative bacteria, 
Gram-positive bacteria, acid-fast bacteria, mycoplasma, aerobic bacteria, anaerobic 
bacteria, and facultative bacteria. 

42. The method of claim 41, wherein said genus is Agrobacterium, 
Mycobacterium, Chlamydia, Pseudomonas or Streptomyces . 

43. The method of claim 42, wherein said genus is obtained from 
Agrobacterium. 

44. A method of expressing at least one heterologous gene in a host cell 
comprising: 

transforming said host cell with a nucleic acid molecule encoding a complete a 
subunit of an RNA polymerase obtained from the same genus as the heterologous gene or a 
portion of said a subunit of the RNA polymerase operably linked to a promoter, and with 
at least one heterologous gene operably linked to a promoter; and 

culturing said transformed host cell under conditions wherein at least one of said 
heterologous genes is expressed in said host cell. 

45. The method of claim 44, further comprising transforming said host cell with 
at least one gene encoding a heterologous transcriptional regulator obtained from the same 
genus as the source of the heterologous gene. 

'46. The method of claim 45, wherein said transcirptional regulator is a 
transcriptional activator that interacts with said a subunit of the RNAP to enhance the 
expression of said heterologous gene. 

47. The method of any one of claims 44-46, wherein said heterologous gene 
comprises multiple genes or operons in the same metabolic pathway. 
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48. The method of claim 44, wherein said nucleic acid molecule and said at least 
one heterologous gene are contained in at least one vector. 

49. The method of claim 44, wherein said host cell is transformed with a vector 
comprising the nucleic acid molecule and with at least one vector comprising at least one 
heterologous gene. 

50. The method of claim 44, wherein said host cell is a prokaryotic cell or a 
eukaryotic cell. 

51 . The method of claim 50, wherein said host cell is a prokaryotic cell selected 
from the group consisting of Gram-negative bacteria, Gram-positive bacteria, aicd-fast 
bacteria, mycoplasma, aerobic bacteria, anaerobic bacteria, and facultative bacteria. 

52. The method of claim 51, wherein said host cell is from the genus 
Escherichia, Pseudomonas , Bacillus, Lactobacillus, Clostridium or Rhizobium. 

53. The method of claim 52, wherein said prokaryotic cell is an Escherichia, 

54. The method of claim 44, wherein said genus of the source of the heterolgous 
gene is a prokaryotic cell selected from the group consisting of Gram-negative bacteria, 
Gram-positive bacteria, acid-fast bacteria, mycoplasma, aerobic bacteria, anaerobic 
bacteria, and facultative bacteria. 

55. The method of claim 54, wherein said genus is Agrobacterium, 
Mycobacterium, Chlamydia, Pseudomonas or Streptomyces . 

56. A purified a subunit of an RNA polymerase of Agrobacterium or a portion 
thereof. 

57. A purified a subunit of an RNA polymerase of Agrobacterium of claim 55 or 
a portion thereof wherein the amino acid sequence of said a subunit is depicted in Figure 

2 A or a portion thereof. 

58. A purified a subunit of an RNA polymerase of Agrobacterium of claim 55 or 
portion thereof, wherein the a subunit of RNA polymerase is encoded by the nucleic acid is 
depicted in Figure 2B or a portion thereof. 
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AMENDED CLAIMS 

[received by the International Bureau on 15 August 2000 (15.08.00); 
original claims 3-10, 15-20, 28, 34, 57 and 58 amended; 
remaining claims unchanged (6 pages)] 

1 . An isolated nucleic acid molecule encoding at least a portion of the a subunit 
of an RNAP of Agrobacterium. 

2. An isolated nucleic acid molecule encoding the complete a subunit of an 
RNAP of Agrobacterium. 

3. An isolated nucleic acid molecule encoding the complete a subunit of an 
RNAP of Agrobacterium, wherein the amino acid sequence of said a subunit is depicted in 

Figure 2 A (SEQ ID NO:l). 

4. An isolated nucleic acid molecule encoding at least a portion of the a subunit 
of an RNAP of Agrobacterium, wherein the amino acid sequence of said a subunit is 
depicted in Figure 2 A (SEQ ID NO: 1). 

5. The nucleic acid molecule of claim 4, wherein said portion of the amino acid 
sequence as depicted in Figure 2 A (SEQ ID NO:l) comprises at least amino acid residues 
157 to 336. 

6. The nucleic acid molecule of claim 4, wherein said portion of the amino acid 
sequence as depicted in Figure 2A (SEQ ID NO:l) comprises less than amino acid residues 
157 to 336. 

7. The nucleic acid molecule of claim 4, wherein said portion of the amino acid 
sequence as depicted in Figure 2A (SEQ ID NO:l) comprises at least 8 consecutive amino 
acid residues in length. 

8. The nucleic acid molecule of claim 7, wherein said portion of the amino acid 
sequence as depicted in Figure 2 A (SEQ ID NO:l) comprises amino acid residues 329 to 
336. 

9. An isolated nucleic acid molecule comprising the nucleic acid sequence as 
depicted in Figure 2B (SEQ ID NO: 2). 

10. An isolated nucleic acid molecule comprising at least a portion of the nucleic 
acid sequence as depicted in Figure 2B (SEQ ID NO: 2). 

11. A hybrid nucleic acid molecule for the expression of at least one 
heterologous gene in a host cell, wherein said hybrid nucleic acid molecule comprises a 
first nucleic acid sequence encoding at least a portion of the a subunit of an RNAP obtained 
from the same genus as the host cell and a second nucleic acid sequence encoding at least a 
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portion of the a subunit of an RNAP obtained from the same genus as the heterologous 
gene. 

12. The isolated nucleic acid molecule of claim 11, wherein said second nucleic 
acid sequence is obtained from a prokaryotic cell selected from the group consisting of 
Gram-negative bacteria, Gram-positive bacteria, acid-fast bacteria, mycoplasma, aerobic 
bacteria, anaerobic bacteria, and facultative bacteria. 

13 The isolated nucleic acid molecule of claim 12, wherein said second nucleic 
acid sequence is obtained from the genus Agrobacterium, Mycobacterium, Chlamydia, 
Pseudomonas or Streptomyces . 

14. The nucleic acid molecule of claim 13, wherein said second nucleic acid 
sequence is obtained from Agrobacterium. 

15. The nucleic acid molecule of claim 14, wherein said second nucleic acid 
sequence encodes the amino acid sequence as depicted in Figure 2 A (SEQ ID NO:l). 

16. The nucleic acid molecule of claim 14, wherein said second nucleic acid 
sequence encodes at least a portion of the amino acid sequence as depicted in Figure 2A 
(SEQ ID NO: 1). 

17. The nucleic acid molecule of claim 16, wherein said portion of the amino 
acid sequence as depicted in Figure 2 A (SEQ ID NO: 1) comprises at least amino acid 
residues 157 to 336. 

18. The nucleic acid molecule of claim 16, wherein said portion of the amino 
acid sequence as depicted in Figure 2 A (SEQ ID NO:l) comprises less than amino acid 
residues 157 to 336. 

19. The nucleic acid molecule of claim 16, wherein said portion of the amino 
acid sequence as depicted in Figure 2 A (SEQ ID NO:l) comprises at least 8 consecutive 
amino acid residues in length. 

20. The nucleic acid molecule of claim 19, wherein said portion of the amino 
acid sequence as depicted in Figure 2A (SEQ ID NO:l) comprises amino acid residues 329 
to 336. - 

21: The hybrid nucleic acid molecule of claim 11, wherein said first nucleic acid 
sequence is obtained from a prokaryotic cell selected from the group consisting of Gram- 
negative bacteria, Gram-positive bacteria, acid-fast bacteria, mycoplasma, aerobic bacteria, 

anaerobic bacteria, and facultative bacteria. 
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22. The nucleic acid molecule of claim 21 , wherein said first nucleic acid 
sequence is obtained from the genus Escherichia, Pseudomonas , Bacillus, Lactobacillus, 
Clostridium or Rhizobium. 

23. A nucleic acid construct comprising a nucleic acid molecule of any one of 
claims 1-22 operably linked to a promoter. 

24. A vector comprising the nucleic acid construct of claim 23. 

25. A host cell comprising a nucleic acid construct of claim 23, wherein said 
promoter is operable in said host cell. 

26. A host cell, of claim 25, further comprising at least one heterologous gene 
operably linked to a promoter. 

27. A host cell of claim 26, wherein said heterologous gene and promoter are 
contained in a vector. 

28. A host cell of claim 26, wherein said host cell is a prokaryotic cell or a 
eukaryouc cell. 

29. A host cell of claim 28, wherein said host cell is a prokaryotic cell selected 
from the group consisting of Gram-negative bacteria, Gram-positive bacteria, acid-fast 
bacteria, mycoplasma, aerobic bacteria, anaerobic bacteria, and facultative bacteria. 

30. A host cell of claim 29, wherein said host cell is an Escherichia, 
Pseudomonas, Bacillus, Lactobacillus, Clostridium or Rhizobium. 

31. A method of expressing at least one heterologous gene in a host cell 
comprising: 

transforming a host cell with a nucleic acid molecule of 23 and with at least one 
heterologous gene operably linked to a promoter; and 

culturing said transformed host cell under conditions wherein at least one of said 
heterologous genes is expressed in said host cell. 

32. The method of claim 31, further comprising transforming said host cell with 
at least one gene encoding a heterologous transcriptional regulator obtained from the same 
genus as the source of the heterologous gene. 

33. The method of claim 32, wherein said transcriptional regulator is a 
transcriptional activator that interacts with the a subunit of the RNAP to enhance the 
expression of said heterologous gene. 
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34. The method of claim 31. wherein said heterologous gene comprises multiple 
genes or operdns in the same metabolic pathway. 

35. The method of claim 31 , wherein said nucleic acid molecule and at least one 
of said heterologous genes are contained in at least one vector. 

36. The method of claim 31, wherein said host cell is transformed with a vector 
comprising the nucleic acid molecule and with at least one vector comprising at least one 
heterologous gene. 

37. The method of claim 31, wherein said host cell is a prokaryotic cell or a 
eukaryotic cell. 

38. The method of claim 37, wherein said host cell is a prokaryotic cell selected 
from the group consisting of Gram-negative bacteria, Gram-positive bacteria, acid-fact 
bacteria, mycoplasma, aerobic bacteria, anaerobic bacteria, and facultative bacteria. 

39. The method of claim 38, wherein said host cell is an. Escherichia, 
Pseudomonas , Bacillus, Lactobacillus, Clostridium or Rhizobium. 

40. The method of claim 39, wherein said prokaryotic cell is an Escherichia. 

41 . The method of claim 31 , wherein said genus of the source of the 
heterologous gene is a prokaryotic cell selected from the group consisting of Gram-negative 
bacteria, Gram-positive bacteria, acid-fast bacteria, mycoplasma, aerobic bacteria, 
anaerobic bacteria, and facultative bacteria. 

42. The method of claim 41, wherein said genus is Agrobacterium, 
Mycobacterium % Chlamydia, Pseudomonas or Streptomyces . 

43. The method of claim 42, wherein said genus is obtained from 
Agrobacterium. 

44. A method of expressing at least one heterologous gene in a host cell 
comprising: 

transforming said host cell with a nucleic acid molecule encoding a complete a 
subunit of an RNA polymerase obtained from the same genus as the heterologous gene or a 
portion of said a subunit of the RNA polymerase operably linked to a promoter, and with 
at least one heterologous gene operably linked to a promoter; and 

culturing said transformed host cell under conditions wherein at least one of said 
heterologous genes is expressed in said host cell. 
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45. The method of claim 44, further comprising transforming said host cell with 
at least one gene encoding a heterologous transcriptional regulator obtained from the same 
genus as the source of the heterologous gene. 

46. The method of claim 45, wherein said transcriptional regulator is a 
transcriptional activator that interacts with said a subunit of the RNAP to enhance the 
expression of said heterologous gene. 

47. The method of any one of claims 44-46, wherein said heterologous gene 
comprises multiple genes or operons in the same metabolic pathway. 

48. The method of claim 44, wherein said nucleic acid molecule and said at least 
one heterologous gene are contained in at least one vector. 

49. The method of claim 44, wherein said host cell is transformed with a vector 
comprising the nucleic acid molecule and with at least one vector comprising at least one 
heterologous gene. 

50. The method of claim 44, wherein said host cell is a prokaryotic cell or a 
eukaryotic cell. 

51 . The method of claim 50, wherein said host cell is a prokaryotic cell selected 
from the group consisting of Gram-negative bacteria, Gram-positive bacteria, acid-fast 
bacteria, mycoplasma, aerobic bacteria, anaerobic bacteria, and facultative bacteria. 

52. The method of claim 51, wherein said host cell is from the genus 
Escherichia, Pseudomonas , Bacillus, Lactobacillus, Clostridium or Rhizobium. 

53. The method of claim 52, wherein said prokaryotic cell is an Escherichia. 

54. The method of claim 44, wherein said genus of the source of the 
heterologous gene is a prokaryotic cell selected from the group consisting of Gram-negative 
bacteria, Gram-positive bacteria, acid-fast bacteria, mycoplasma, aerobic bacteria, 
anaerobic bacteria, and facultative bacteria. 

55. The method of claim 54, wherein said genus is Agrobacterium, 
Mycobacterium, Chlamydia, Pseudomonas or Streptomyces. 

56. A purified a subunit of an RNA polymerase of [Agrobacterium or a portion 
thereof. 

57. A purified a subunit of an RNA polymerase of Agrobacterium of claim 55 or 
a portion thereof wherein the amino acid sequence of said a subunit is depicted in Figure 
2A (SEQ ID NO:l) or a portion thereof. 
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58 A purified a subunit of an RNA polymerase of Agrobaaerium of claim 55 or 
portion thereof, wherein the a subunit of RNA polymerase is encoded by the nucleic acid is 
depicted in Figure 2B (SEQ ID NO:2) or a portion thereof. 
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FIG. 2B 

1 aacacggcat gaagtcgctt gaagtcgaag tttgcggtcc gggttceggt cgtgaatcgg 
61 cacttcgcgc tctgcaggct gccggtttca tgatcacttc cattcgcgac gccgatcccg 
121 cacaacggtt gccgtccgcg caagaagcgc cgcgtctgac gcgaccgtgg ttcggaaatt 
181 ccgcctttcc ttcggtctgg cggaattttc gtgtatctgg cgtgtgcgcg tcgatttcga 
241 tcgacggact tgcgctcaag aacccactga tgaaccactg aattaggttc ctctcggtgt 
301 tttcatgctc ggtccgtcac gattggatgg tggcggcgaa cggaaggttt aaagatgatt 
361 cagaagaact ggcaggaact tatcaagccg aacaaggtcg agttcacctc gtccagccgc 
421 accaaggcaa ctctggttgc cgagccgctg gagcgtggtt tcggtcttac cctcggcaac 
481 gcgctgcgcc gcgttctgtt gtcttctctg cgtggtgccg ctgtaacggc cgtgcagatc 
541 gacggtgtcc tgcacgaatt ctcctccatc cccggcgttc gggaagatgt gacggatatc 
601 gtgctcaaca tcaaggaaat cgccatcaag atggatggtg acgattccaa gcgcatggtc 
661 gtgcgeaagc agggtccggg ttcggtaacc gctggtgaca tccagacggt tggcgacatc 
721 gagatcctga accccgacca cgtgatctgc acgctcgatg aaggcgctga aatccgcatg 
781 gaattcaccg tcaacaacgg caagggttac gtaccggctg agcgcaaccg cgcggaagat 
841 gcccctatcg gcctcattcc ggtggacagc ctctattctc cggtcaagaa agtgtcctac 
901 aaggtggaaa acacccgtga aggtcaggtt ctcgactatg acaagctgat catgacgatc 
961 gagaccaacg gttcggtttc cggcgaagac gccgttgcct tcgccgctcg cattcttcag 
1021 gaccagctgg gcgtcttcgt caacttcgac gagccgcaga aggaagcaga agaagaatcg 
1081 gttactgaac tcgcgttcaa cccggcgctt ctcaagaagg tcgacgagct cgaactgtca 
1141 gttcgttcgg caaactgcct gaagaacgac aacatcgttt atatcggcga cctgatccag 
1201 aagaccgaag ccgaaatgct ccgcacgccg aactttggtc gcaagtcgct gaacgaaatc 
1261 aaggaagttc tcgcttccat gggtctgcac ctcggcatgg aagtgccggc atggccgcct 
1321 gagaacatcg aagatctcgc aaagcgttac gaagaccaat actaacaaac aagaaggcag 
1381 accttaaaga ctgcctttcc ccgtcaaaca gcagataagt catctgcatg tgccaggaaa 
1441 cggcaggcct taaagaaggc acctgcgtag aaggagaata gcaatgcgcc acggtaattc 
1501 aggccgcaag ctcaatagaa ccgccagcca ccgcaaggca atgtttgcca acatggctgc 
1561 ttcgctcatc acccatgagc agatcgtcac cacccttccg aaggcgaagg aaatccgtcc 
1621 gatcgtcgag cgtctcgtga cgctgggcaa gcgcggcgac ctgcacgctc gtcgtcaggc 
1681 gatctcgcag at 
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